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Description 

Field of the invention 

s [0001] The invention is in the field of genetic disease diagnosis, tumor detection and treatment, and genetic ther- 
apy. Specifically, the invention is directed to the merlin gene, merlin protein, and the use of the gene and/or protein for 
(1) detecting a predisposition to develop tumors especially neurofibromatosis-2, (2) diagnosing certain tumors, espe- 
cially neurof ibromatosis-2, (3) treating tumors, especially neurofibromatosis-2, (4) monitoring the course of tumor treat- 
ment, especially neurofibromatosis-2 treatment, and (5) gene replacement in affected non-tumor tissues or cells. 

ic 

Background of the Invention 

[0002] Neurofibromatosis (NF) describes two major human genetic disorders that display autosomal dominant 
inheritance, involve tumors of the nervous system, and which are distinct clinical entities (Mulvihill. J.J. et ai, Ann. 

is Intern. Med. 1 73:39-52 (1990)). NF1, or von Recklinghausen NF, is more common (incidence of 1/4,000) and is char- 
acterized by the highly variable expression ot an array of features that include neurofibromas, cafe-au-lait macules, 
Lisch nodules of the iris, and a predisposition to certain malignant tumors (Riccardi, VM. ef at., N. Engl. J. Med. 
305:1617-1627 (1981); Riccardi, VM el at., 'Neurofibromatosis: Phenotype, Natural History, and Pathogenesis," 
Johns Hopkins Univ. Press, Baltimore, MD (1986)). It is caused by defects in a gene on chromosome 17 that has 

20 recently been isolated and characterized (Viskochil, D. era/.. Cell 62 A 87-1 92 (1990); Cawthon, R.M. etai, Celt 62:193- 
201 (1990); Wallace, M.R. etai, Science 249:183-186 (1990)). The NF1 gene product, neurofibroma, is a large protein 
with a GAP-reiated domain and is presumably involved in modulating a signal transduction pathway whose disruption 
can lead to tumor formation (Baliester, R. ef ai., Cell 63:851-859 (1990); Buchberg, A.M. et at.. Nature 347:291-294 
(1990); Xu, G. et ai... Cell 62:599-608 (1990); DeClue, J.E. et at., Cell 69:265-273 (1992); Basu, T.N. ef ai.. Nature 

25 356:713-715(1992)). 

[0003] In contrast, neurofibromatosis-2 (NF2), which occurs in about 1/40,000 livebirths (Evans, D.G.R. etai, J. 
Med. Genet. 29:841-846 (1992)), is characterized by bilateral schwannomas that develop on the vestibular branch of 
the 8th cranial nerve. Pressure from these tumors often causes hearing loss and vestibular symptoms in the second 
and third decade. Other tumors of the brain, especially meningiomas and schwannomas of other cranial nerves and spi- 

30 nal nerve roots (Martuza, R.L ef at.. N. Engl. J. Med. 378:684-688 (1988)), and posterior capsular lens opacities (Kai- 
ser-Kupfer, M.I. etai.. Arch. Ophthalmol. 707:541-544 (1989) are commonly present in the young affected adult. 
[0004] The NF2 gene is highly penetrant. Ninety-five percent of persons with the genotype develop bilateral vestib- 
ular schwannomas. NF2 is often more severe than NF1. Teenage or early adulthood onset of multiple slow growing 
tumors that can gradually cause deafness, balance disorder, paralysis or increasing neurologic problems necessitating 

35 repeated surgical procedures, characterizes NF2. 

[0005] NF2 has been shown to be genetically distinct from NF1 by linkage studies that assigned the NF2 gene to 
chromosome 22 (Rouleau, G.A. etai., Nature 329:246-248 (1987); Wertelecki, W. etai., N. Engl J. Med. 379278-283 
(1988); Rouleau, G.A. et at, Am. J. Hum. Genet 46:323-328 (1990); Narod, S.A. etai., Am. J. Hum. Genet 57:486- 
496 (1992)). The tumor types that occur in NF2 can be seen in the general population as solitary, sporadic tumors. 

4c Since frequent loss of alleles on chromosome 22 from both sporadic vestibular schwannomas and meningiomas, and 
from their counterparts in NF2 had been noted previously, the localization of the inherited delect to the same chromo- 
some region suggested that the NF2 locus encodes a recessive tumor suppressor gene (Knudson, AG. et at ., Proc. 
Natl. Acad. Set. U.S.A. 68:820-823 (1971)) whose inactivation leads to tumor formation (Seizinger, B.R. era/.. Nature 
322:644-647 (1986); Seizinger, B.R. etai., Science 236:317-319 (1987); Seizinger, B.R. etai, Proc. Natl. Acad. Set. 

45 U.S.A. 64:5419-5423 (1987). 

[0006] A number of studies of sporadic tumors and tumors from NF2 patients have provided support for this hypoth- 
esis (Couturier, J. et ai.. Cancer Genet Cytogenet. 45:55-62 (1990); Rouleau, G.A. etai.. Am. J. Hum. Genet. 46:323- 
328 (1990), Fiedler, W etai., Genomics 70:786-791 (1991); Fontaine, B. etai.. Ann. Neuroi 29:183-196 (1991); Fon- 
taine, B. etai, Genomics 10 280-283 (1991); Bijlsma, E.K. etai, Genes Chromosom. Cancer 5:201 -205 (1992); Wolff, 

sc R.K. et ai.. Am. J. Hum. Genet. 57:478-485 (1992)). The combined use of family studies and tumor deletion mapping 
has progressively narrowed the location of NF2 within the q1 2 band of chromosome 22, and defined a candidate region 
m which to search for the NF2 genetic defect (Rouleau, G.A. etai., Am. J. Hum. Genet. 46:323-328 (1990); Wolff, R.K. 
etai. Am. J. Hum. Genet. 5/ 478-485 (1992)). 
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Brief Description of the Drawings 
[0007] 

5 Figure 1. Pulsed-field gel analysis of lymphoWast DNA from NF2 patients. DNA in agarose blocks was digested 

with Noli, subjected to electrophoresis, blotted, and hybridized to a radiolabeled 4 kbp NEFH probe. Lane 1, 
GUS6274 (affected NF2 unrelated to lanes 3 and 4); lane 2, GUS7870 (normal human); lane 3, GUS5068 (affected 
daughter), lane 4, GUS5069 (affected mother). 

ro Figure 2 Physical map of chromosome 22 region surrounding the NF2 deletions. Notl sites were determined by 
pulsed-fiekj gel analysis and confirmed in cosmids. Cosmids were named according to their standardized library 
coordinates. The enlarged region shows details of the merlin gene vicinity with deleted regions denoted by under- 
lying brackets. Probes used in deletion analysts: A, 14 kbp Hindlll fragment from cosmid 96C10; B, 8 5 kbp Hindi II 
fragment from 28H6 that spans the T3 end of 96C10; C, 8 kbp Hindlll end-fragment from the T7 end of 96C10; 

>s 96T3 (primer pair: 

5CAGATTGTTCATTCCAAGTGG3' [SEQ ID No 1] and 
5 ACCCTGAGGAATCCACTACC3' [SEQ ID No: 2], 

20 product size* 124 bp) and 96T7 

(5TGCACACACATCCTTTTCAC3' [SEQ ID No: 3], and 
5GAGAGAGACTGCTGTCTCAAAAA3' [SEQ ID No: 4J. 

25 product size-92 bp) are sequence-tagged site assay (STS)-derived from the T3 and T7 ends of 96C10. respec- 
tively X-Xhol recognition site. Arrows indicate the orientation of transcription and approximate genomic coverage 
of cDNA JJR-1. 

Figure 3A to 3F Complete nucleotide and predicted amino acid sequence of merlin JJR-1 cDNA clone [SEQ ID 
30 No: 15 and 16, respectively] . Common moestn-ezrin-radixin domain spans amino acid residues 1 -358. Arrow indi- 
cates where poly-A addition has occurred in two independent clones (JJR-6 and JJR-9) 

Figure 4. Northern Wot analysis of RNA derived from human tumor cell lines. RNA was size-fractionated by elec- 
trophoresis in formaldehyde-agarose. blotted and hybridized as described in Experimental Procedures. The blot 
35 was hybridized with a radiolabeled probe corresponding to bases 1253-1942 of the JJR-1 cDNA. Lane 1 , SK-N-NB 
neuroblastoma: lane 2, T98G glioblastoma: lane 3. WERi retinoblastoma: lane 4, MCF-7 breast carcinoma: lane 5, 
HOS osteosarcoma: lane 6. HEPG2. liver carcinoma: lane 7, CATA4 kidney carcinoma: lane 8. SW480 colon car- 
cinoma: lane 9, EJ bladder carcinoma. 

40 Figure 5. Location of trapped exons in JJR-1 cDNA. The blackened box in the cDNA indicates open reading frame: 
a thin line indicates both 5' and 3' untranslated regions. Probe names correspond to those used in Figures 5A and 
B. Probe II (5A) represents bp 818-893 of JJR-1 and is overlapped by another trapped product. These overlapping 
products may have arisen from partial digestion of the cosmid or alternative splicing in COS7 cells occurring during 
exon amplification analysis. 

ds Figure 5A (probe I) and Figure 5B (probe II) . Southern blot analysis of GUS5069 X hamster hybrid cell lines 

harboring the altered chromosome 22. DNA samples were digested with BamHI, size-fractionated in agarose gels, 
blotted and hybridized with radiolabeled probes indicated above each panel. Lane 1, GUS1323 (human control), 
lane 2, CHTG49 (hamster control); lane 3, (GUSH134A3 (hybrid with deleted homolog); lane 4, GUSH134B1 
(hybrid with deleted homolog), lane 5, GM 10888 (chromosome 22-only hybrid); lane 6, Eye3A6 (hybrid containing 

so chromosome 22 and smaller portions of two to three other chromosomes). 

Figure 5C (probe III), Figure 5D (probe IV) and Figure 5E (probe V) Southern blot analysis of NF2 patient lym- 
phoblast DNA. Samples were digested with EcoRI, size-fractionated, Wotted and hybridized with radiolabeled 
probes indicated above each panel Lane 1, GUS1323 (human control): lane 2, GUS5069 (NF2 affected); lane 3. 

• so oeo n experimental r cceouies oamoies were suDiecieo to eiec:i op not esib hi m . 'c agarose gei ano ,■■:> 
..a ized Dy staining with ethidum txonide and ultraviolet ilumnation The products seen correspona to the -!\A 
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sequence encompassed by bases 1258 to 1947 (Primer pair: 

5 AGGAGGCTGAACQCACGAG3' [SEQ ID No: 5] and 
5TGGTATTGTGCTTGCTGCTG3' [SEQ ID No: 6]. 

5 

Lanes 1,2,4, lymphobiasts from independent NF2 patients: lane 3, GUS5068, lane 5, normal control lymphoblast; 
lane 6, GUS5722. 

Figures 6A and 6B SSCP analysis of RNA samples derived from primary cultures of meningioma tumors from NF2 
ig patients. Lane 1 , undenatured PGR product from the tumor in lane 2: lanes 2, 3, 4, and 5. denatured PCR product 
from independent primary NF2 meningiomas; lane 6, denatured PCR product from GUS5068 lymphoblast. The 
PCR reactions amplified the following segments: Figure 6A. bp 1709-1947 (Primer pair: 

5'CTTC AAC CTGATTGGTG AC AG3 ' [SEQ ID No: 7] and 
15 5TGGTATTGTGCTTGCTGGTG3' [SEQ ID No: 8] 

and Figure 6B: bp 457-730 (Primer pair: 

S'AGGTACTGGATCATGATGTTTCS' [SEQ ID No: 9] and 
20 5TTTGGAAGCAATTCCTCTTGG3') [SEQ ID No: 10]. 

All lanes contain a proportion of undenatured product, detected by comparison with lane 1 . 

Figure 7A and 78 Comparison of sequence identities in merlin- related proteins. The program PILEUP of the 
25 GCG package (Devereux, J. era/., Nucleic Acids Res. 12:387-395 (1984)) was used to generate an optimal align- 
ment of the protein products translated from the following GenBank files: Human Moesin (M69066); Human Ezrin 
(X51521); Mouse Radixin (S66820); Echinococcus multifocal aris tegument protein (M61186); human erythrocyte 
protein 4.1 (M14993) . Only amino acid identities are shown; non-tdentical residues are indicated by "." and gaps 
introduced by the program are represented by empty spaces. 

30 

Figure 8 Pedigree of family with neurofibromatosis type 2. 

Only a portion of the large NF2 family described in detail previously (Wertelecki. W. era/., New Engl. J. Med. 
379:278-283 (1988)) is presented here schematically. Affected individuals are shown by a filled symbol. The sex 
for each individual in generations ill through VI is not given to protect confidentiality. Diamond symbols indicate the 
35 presence of one or more additional offspring in a sibship, with no indication of NF2 state being presented, again to 
disguise the pedigree and provide confidentiality. The deduced genotypes for selected individuals are shown below 
the corresponding symbol as "1 V (homozygous for the normal Asn220 codon) and "1 2" (heterozygous with one 
normal Asn220 codon, and one mutated Tyr220 codon) The arrow indicates the affected individual used in the ini- 
tial screen to detect the putative mutation. 

40 

Figure 9. SSCP analysis of affected and unaffected individuals. 

A non-denatured sample (ND) was run to identify the mobility of residual double stranded PCR product. Bands 
of abnormal mobility appear in lanes containing DNA derived from affected individuals and are identified by a small 
arrowhead. Samples represent paired sibs. with the odd numbered samples being from unaffecteds, and the even 
45 numbered samples deriving from affecteds. 

Figures 10A and 10B. Direct DNA sequence analysis of the alteration. 

The DNA sequence surrounding the site of the putative mutation is shown for both a patient (Fig. 1 0A) and nor- 
mal control sample (Fig. 1 0B) with the corresponding protein sequence aligned by codon. The patient displays both 
sc an A and a T residue as the first base of codon 220 indicating the presence of a Tyr codon in addition to the normal 
Asn codon 

Figure 11. Rsaf digestion to confirm the presence of the Asn220/Tyr mutation in affected individuals. 

Tho aiTo'ffPHl PCR products cnntRimno p*or 7 from affpctpd W and unatfpo + pd d ^ individuals wpro rlirpstpd 

; / oua'c >ieu *wu fragments ji ix> aiu 'o ou. ju-j :l digesnor- ui i H sai sa^ rii ^ioss«ny ^oogtd j. - r,-^. 
' C n oroducts *ron a'fected individuals produce an additional f-agment of 67 op due :o cigestion of the Hsa: 
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site at codons 219-220. 

Figure 12 Conserved sequence domain affected by the putative mutation 

The amino acid sequence from residues 200-2*0 of the merlin protein is shown relative to the homologous 
5 regions for human moesin, human ezrin and mouse radixin (Trofatter, J. et a/., Ceil 72:791 -800 (1993)). The Asn 

residue (N) altered to a Tyr by the putative NF2 mutation is starred. 

Figures 13A to 13C. Representative SSCP analyses. 

For each exon, control blood (B) and tumor (T) DNAs were analyzed and a nondenatured sample (ND) was 
w included. Abnormal migration patterns are observed fa each tumor DNA. Two blood DNAs from NF2 patients, Si 1 
and S33. show the same alterations in E2 as the tumor DNAs, suggesting that these shifts represent germline 
mutations. 

Figures 14A and 14B. Direct DNA sequence analysis of two alterations in S29. 
is The DNA sequences surrounding the sites of two mutations are shown for tumor DNA (T) and control blood 

DNA (B) from patient S29. The tumor is heterozygous tor each of a one base pair deletion in exon 2 and a four base 
pair deletion in exon 8 (shown in boxes) These deletions are predicted to generate premature stop codons at posi- 
tions 123 and 251 , respectively. 

20 Figures 15A through 15Q. Exon sequences in the merlin gene with intron boundary sequences. 



15A: 


Exon 1 . 


15B: 


Exon 2. 


15C: 


Exon 3. 


15D: 


Exon 4. 


15E: 


Exon 5. 


15F: 


Exon 6. 


15G: 


Exon 7. 


15H: 


Exon 8. 


151: 


Exon 9. 


15J: 


Exon 10. 


15K: 


Exon 11. 


15L: 


Exon 12. 


15M: 


Exon 13. 


15N 


Exon 14. 


150: 


Exon 15. 


15P 


Exon 16. 


15Q: 


Exon 17. 



[0008] The vertical line on the left indicates the 5* intron exon boundary. The vertical line on the right indicates the 
intron exon 3' boundary The designation "N M indicates that the nucleotide is not definitively determined. 

Summary of the invention 

*5 [0009] The invention is directed to the protein merlin, mutants thereof, nucleic acid encoding this protein, nucleic 
acid encoding merlin regulatory regions and exons. mutant nucleic acid sequences, and uses thereof. 
[001 0] Accordingly, in a first embodiment, the invention is directed to purified preparations of the protein merlin, or 
mutants thereof. 

[0011] In a further embodiment, the invention is directed to an isolated nucleic acid sequence encoding merlin, or 
so mutants thereof. 

[001 2] In a further embodiment, the invention is directed to a recombinant construct containing nucleic acid encod- 
ing merlin, or mutants thereof. 

[0013] In a further embodiment the invention is directed to a vector containing nucleic ac»d encoding merln or 
mutants thprpof 

)Q1;- . " t> ^- • • • ' • • • • -\ • '• ...... 

t uui b] ■ ■ a runner emooameni, tne irne^ticn ^ -ectea to a .'netnoa i o- Jiagncs ng a menm-associatea tunc- 
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such tumor being a tumor characterized by a loss, alteration, or decrease of the activity of the merlin tumor suppressor 
m the cells of said tumor, and especially by a loss or mutation of the merlin gene, in such cells. In one specific embod- 
iment, the mutation is a change from A T in the first position (base) of the amino acid at position 220 according to 
Figure 3, especially wherein tyrosine is substituted tor asparagme at amino acid position 220. The nucleic acid change 

5 can be detected by flsal. In more general embodiments, the mutations include the genetic sequence alterations in the 
merhn gene, described in Example 6 herein, and which contribute to tumor formation and especially NF2. 
[0017] in a further embodiment, the invention is directed to a method for treating a mer/Zn-associated tumor in a 
patient, where the growth of such tumor reflects a functional change in merlin, a decreased level, or lack of merlin tumor 
suppressor activity in the tumor cell, the method comprising providing a functional merlin gene to the tumor cells of the 

10 patient, in a manner that permits the expression of the merlin protein provided by the gene, for a time and in a quantity 
sufficient to inhibit the growth of the tumor in the patient. 

[001 8] In a further embodiment, the invention is directed to a method of gene therapy of a symptomatic or presymp- 
tomatic patient, the method comprising providing a functional merlin gene to the relevant cells of the patient, both nor- 
mal or tumor, in need of the therapy, in a manner that permits the expression of the merlin protein provided by the gene, 

15 for a time and in a quantity sufficient to provide the tumor suppressor function of merlin to the cells of the patient. 
[0019] In a further embodirnerrt, the invention is directed to a method for diagnosing NF2, the method comprising 
detecting a mutation in, or loss of, the merlin gene, or merlin protein, in a sample of non-tumor biological material from 
the subject to be diagnosed. This includes but is not limited to patients and single cells, such as embryonic cells a pre- 
natal cells from amniotic fluid. In a specific embodiment, the detection is of a mutant merlin protein encoded by a muta- 

20 tion of A -► T at the first position (base) of amino acid 220, especially where tyrosine has been substituted for asparag- 
ine at amino acid position 220, or of a mutated nucleotide coding sequence involving an A -> T transvsrsion at petition 
the first base of amino acid 220, detectable by Rsa\ digestion. In a more general embodiment, the detection is of a 
mutant merlin protein encoded by DNA containing mutations including those described in Example 6 herein and which 
produce an altered merlin protein. 

25 [0020] in a further embodiment, the invention is directed to a method for screening an individual for future likelihood 
of developing mer/f/7-associated tumors, or the disease NF2, the method comprising detecting a mutation in, or loss of, 
the merlin gene, or merlin protein, in a sample of biological material from the individual. In a specific embodiment, the 
detection is of a mutant merlin protein encoded by a mutation of A T at the first position (base) of amino acid 220, 
especially where tyrosine has been substituted for asparagine at amino acid position 220, or of a mutated nucleotide 

30 coding sequence involving an A -> T transverston at the first base of amino acid 220, detectable by Rsa\ digestion. In 
more general embodiments, the detection is of a mutant merhn protein or mutant merlin DNA. wherein the mutation 
includes but is not limited to any of the mutations described herein, and that are clinically relevant. 
[0021 ] In a further embodiment, the invention is directed to a method for treating NF2 in a patient, the method com- 
prising providing a functional merlin gene to the desired cells of the patient, in a manner that permits the expression of 

35 the merlin protein provided by the gene, for a time and in a quantity sufficient to treat the patient. 

[0022] In a preferred embodiment, a method is provided for identifying DNA sequence differences representing 
potential mutations within amplified coding sequences using single stranded conformational polymorphism analysis 
(SSCP). In this method, individual axons from the patient's DNA are first amplified by PCR. The amplification products 
are then denatured to separate the complementary strands and diluted to allow each single-stranded DNA molecule to 

*c assume a secondary structure conformation by folding on itself. Gel electrophoresis under non-denaturing conditions 
allows the detection of sequence changes compared to the normal (non-mutant) sequence. 

[0023] In a further preferred embodiment, mutations discovered by application of the above method are used as 
standards of comparison for DNA from individuals suspected of being affected or known to be affected, so as to identify 
the mutation (if any) without full application of the above method. Knowledge of the exact mutation then allows the 
45 design of molecular therapeutic vehicles for gene therapy. 

Detailed Description of the Preferred Embodiments 

[0024] The "merlin" gene described herein is a gene found on human chromosome 22 that, as shown herein, con- 
sc tains non-overlapping and interstitial deletions in four independent NF2 patients and a single base change in affected 
individuals from five sibling pairs within an affected kindred, these results form the basis for concluding that this gene 
encodes a protein called "merlin, "which possesses a tumor suppressor activity, merlin has previously been termed "the 
NF2 tumor suppressor." The merlin gene, therefore, includes the DNA sequences shown in Figures 3 and 15A-Q 
herein and all functional equivalents The gene usually includes not only the coding sequences, but all regulatory 

■ ' ho * " • kvi : ■ :v;r. >r ne ! - \« sequences wnr;*- >i ceo *rr:-*^ — ■ n~ -n^ •••h"'^;' : ' ■ 

v c r C V'C! "itrc" C Sequences ShOWn •' ^ui r=rt> " jA-C *Vk"miGriciiiy. -i ■> \u i;- ><je' siuuu ai ire - yei ie 
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includes the corresponding genetic and functional sequences in non-human antmat species. 
[0025] The merlin gene of the invention encodes a novel protein, merlin, that is related to the moesm-ezrin-radixin 
family of cytoskeleton -associated proteins (Gould. K.L et aL EMBO 1 5:4133-4142 (1989); Turunen, O et al , J. Biol. 
Cnem. 264 16727-16732 (1989); Funayama, N. eta!., J. Cell Biol. ? 75:1039-1048 (1991); Lankes, WT et aL. Proc. 
s Natl. Acad. Sci. U.S. A, 56:8297-8301 (1991); Sato, N. et aL, J. Cell. Sci. 703:131-143 (1992)). This protein, which is 
herein named "menin" (moesin-ezrin-radixin-like protein), represents a new class of tumor suppressor whose function 
is mediated by interactions with the cytoskeleton. Merlin is found on human chromosome 22 between the known mark- 
ers D22S1 and D22S28. 

:c I. Ctoning Of Merlin DNA And Expression Of Merlin Protein 

[0026] The identification of merlin cDNA and protein as the mutated or missing gene in tumors from NF2 patients 
is exemplified below. In addition to utilizing the exemplified methods and results for the identification of additional muta- 
tions or deletions of the merlin gene in NF2 patients, and for the isolation of the native human merlin gene, the 

is sequence information presented in Figures 3 and 15A-Q represents nucleic acid and protein sequences, that, when 
inserted into a linear or circular recombinant nucleic acid construct such as a vector, and used to transform a host cell, 
will provide copies of merlin DNA and merlin protein that are useful sources for the merlin DNA and merlin protein for 
the methods of the invention. Such methods are known in the art and are briefly outlined below. 
[0027] The process for genetically engineering the merlin coding sequence, for expression under a desired pro- 

20 moter, is facilitated through the cloning of genetic sequences which are capable of encoding the merlin protein. These 
cloning tscnnclogtss can utilizs techniques known \n the art for construction of a DNA sequence encoding the merlin 
protein, such as polymerase chain reaction technologies utilizing the merlin sequence disclosed herein to isolate the 
merlin gene de novo, or polynucleotide synthesis methods to construct the nucleotide sequence using chemical meth- 
ods. Expression of the cloned merlin DNA provides merlin protein. 

25 [0028] As used herein, the term "genetic sequences" is intended to refer to a nucleic acid molecule (preferably 
DNA). Genetic sequences that are capable of being operably linked to DNA encoding merlin protein, so as to provide 
for its expression and maintenance in a host cell, are obtained from a variety of sources, including commercial sources, 
genomic DNA, cDNA, synthetic DNA, and combinations thereof. Since the genetic code is universal, it is to be expected 
that any DNA encoding the merlin amino acid sequence of the invention will be useful to express merlin protein in any 

30 host, including prokaryottc (bacterial) and eukaryotic (plants, mammals (especially human), insects, yeast, and espe- 
cially cultured cell populations). 

[0029] If it is desired to select a gene encoding merlin de novo from a library that is thought to contain a merlin 
gene, the library can be screened and the desired gene sequence identified by any means which specifically selects for 
a sequence coding for the merlin gene or expressed merlin protein such as, a) by hybridization (under stringent condi- 

35 tions for DNA: DNA hybridization) with an appropriate merlin DNA probe(s) containing a sequence specific for the DNA 
of this protein, the sequence being that provided in Figure 3 or a functional derivative thereof (that is, a shortened form 
that is of sufficient length to identify a clone containing the merlin gene), or b) by hybridization-selected translational 
analysis in which native merlin mRNA which hybridizes to the done in question is translated in vitro and the translation 
products are further characterized for the presence of a biological activity of merlin, or, c) by immunoprecipitation of a 

40 translated merlin protein product from the host expressing the merlin protein. 

[0030] When a human allele does not encode the identical sequence to that of Figures 3 or 15A-Q, it can be iso- 
lated and identified as being merlin DNA using the same techniques used herein. Many polymorphic probes useful in 
the fine localization of genes on chromosome 22 are known and available (see, for example, "ATCC/NIH Repository 
Catalogue of Human and Mouse DNA Probes and Libraries," fifth edition, 1991, pages 23-24; "Human gene mapping 

45 10: tenth international workshop on human gene mapping," Cytogenet. Ceil Genet 57:1-1148 (1989) and Rouleau, 
G.A. era/.. "A genetic linkage map of the long arm of human chromosome 22," Genomics 4:1-6 (1989)). 
[0031] A useful D22S1 probe is clone designation pMS3-18, a Bg/ll-RFLP (allele 1: 8.2kb; allele 2: 3.6 kb); as 
described in Fontaine, B. ef a/. ( Ann, Neuroi. 29:183-186 (1991) and Barker et a/., Cell 36:131-138 (1984). An equiva- 
lent D22S1 probe, pEDF139, is available from the ATCC (ATCC 59688 and ATCC 59689). 

so [0032] Other useful probes include: D22S28 (clone W23C: ATCC 61636 and ATCC 61637); D22S15; C22S32 
(pfasmid pEZF31: ATCC 50274 and ATCC 59275); D22S42; D22S46; D22S56, LIF (the leukemia inhibitory factor 
gene); and NE C H (the neurofilament heavy chain gene). 

[0033] Human chromosome 22-spectfic libraries are known in the art and available from the ATCC for the isolation 
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is published (see Figure 3 m Lees. J.L etal , EM BO J. 7 1947-1955 (1988)). Since it is this sequence that provides the 
specificity for the NEFH gene, it is only necessary that a desired probe contain this sequence, or a portion thereof suf- 
ficient to provide a positive indication of the presence of the NEFH gene. 

[0035] Merlin genomic DNA may or may not include naturally occurring introns. Moreover, the genomic DNA can 
5 be obtained in association with the native merlin 5' promoter region of the gene sequences and/or with the native merlin 
3' transcriptional termination region 

[0036] Merlin genomic DNA can also be obtained in association with the genetic sequences which encode the 5' 
non-translated region of the merlin mRNA and/or with the genetic sequences which encode the merlin 3' non-translated 
region To the extent that a host cell can recognize the transcriptional and/or translational regulatory signals associated 
to with the expression of merlin mRNA and protein, then the 5' and/or 3* non-transcribed regions of the native merlin gene, 
and/or, the 5* and/or 3' non-translated regions of the merlin mRNA can be retained and employed for transcriptional and 
translational regulation. 

[0037] Genomic DNA can be extracted and purified from any host cell, especially a human host cell possessing the 
long arm of chromosome 22, by means well known in the art. Genomic DNA can be shortened by means known m the 

is art, such as physical shearing or restriction enzyme digestion, to isolate the desired merlin gene from a chromosomal 
region that otherwise would contain more information than necessary for the utilization of the merlin gene in the hosts 
of the invention. For example, restriction digestion can be utilized to cleave the full-length sequence at a desired loca- 
tion. Alternatively, or in addition, nucleases that cleave from the 3'-end of a DNA molecule can be used to digest a cer- 
tain sequence to a shortened form, the desired length then being identified and purified by polymerase chain reaction 

20 technologies, gel electrophoresis, and DNA sequencing. Such nucleases include, but are not limited to, Exonuclease III 

Oo/O1 rUK^r rti i/'lftipor nres mall Isnnurn in th Q art 
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[0038] Alternatively, if it is known that a certain host cell population expresses merlin protein, then cDNA tech- 
niques known in the art can be utilized to synthesize a cDNA copy of the merlin mRNA present in such population 
[0039] For cloning the genomic or cDNA nucleic acid that encodes the amino acid sequence of the merlin protein 

25 into a vector, the DNA preparation can be ligated into an appropriate vector. The DNA sequence encoding merlin pro- 
tein can be inserted into a DNA vector in accordance with conventional techniques, including blunt-ending or stagger ed- 
nding termini for ligation, restriction enzyme digestion to provide appropriate termini, filling in of cohesive ends as 
appropriate, alkaline phosphatase treatment to avotd undesirable joining, and ligation with appropriate ligases. Tech- 
niques for such manipulations are well known in the art. 

3C [0040] When the merlin DNA coding sequence and an operably linked promoter are introduced into a recipient 
eukaryotic cell (preferably a human host cell) as a non- replicating, non -integrating, molecule, the expression of the 
encoded merlin protein can occur through the transient (nonstable) expression of the introduced sequence. 
[0041 ] Preferably the coding sequence is introduced on a DNA molecule, such as a closed circular or linear mole- 
cule that is capable of autonomous replication. If integration into the host chromosome is desired, it is preferable to use 

35 a linear molecule. If stable maintenance of the merlin gene is desired on an extrachromosomal element, then it is pref- 
erable to use a circular plasmid form, with the appropriate plasmid element for autonomous replication in the desired 
host. 

[0042] The desired gene construct, providing a gene coding tor the merlin protein, and the necessary regulatory 
elements operably linked thereto, can be introduced into desired host cells by transformation, transfection, or any 
4c method capable of providing the construct to the host cell. A marker gene for the detection of a host cell that has 
accepted the merlin DNA can be on the same vector as the merlin DNA or on a separate construct for co-transforma- 
tion with the merlin coding sequence construct into the host cell. The nature of the vector will depend on the host organ- 
ism. 

[0043] Suitable selection markers will depend upon the host cell. For example, the marker can provide bioctde 

45 resistance, e.g., resistance to antibiotics, or heavy metals, such as copper, or the like 

[0044] Factors of importance in selecting a particular plasmid or viral vector include: the ease with which recipient 
cells that contain the vector can be recognized and selected from those recipient cells which do not contain the vector; 
the number of copies of the vector which are desired in a particular host; and whether it is desirable to be able to "shut- 
tle" the vector between host cells of different species. 

sc [0045] When it is desired to use S. cerevisiae as a host for a shuttle vector, preferred $. cerevisiae yeast plasmids 
include those containing the 2-micron circie, etc, or their derivatives. Such plasmids are well known in the art and are 
commercially available. 

[0046] Oligonucleotide probes specific tor the merlin sequence can be used to identify clones to merlin and can be 
iP S ,Qnp<l '-rn-o from the knowledge of the amino acid sequence of the protein as orovided herein n Figure 3 or from 
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acid contains expression control sequences which contain transcriptional regulatory information and such sequences 
are "operably linked" to the merlin nucleotide sequence which encode the merlin polypeptide. 

[0046] An operable linkage is a linkage in which a sequence is connected to a regulatory sequence (or sequences) 
in such a way as to place expression of the sequence under the influence or control of the regulatory sequence. If the 

5 two DNA sequences are a coding sequence and a promoter region sequence linked to the 5' end of the coding 
sequence, they are operably linked if induction of promoter function results in the transcription of mRNA encoding the 
desired protein and if the nature of the linkage between the two DNA sequences does not (1) result in the introduction 
of a frame-shift mutation, (2) interfere with the ability of the expression regulatory sequences to direct the expression of 
the protein, antisense RNA. or (3) interfere with the ability of the DNA template to be transcribed. Thus, a promoter 

*c region is operably linked to a DNA sequence if the promoter is capable of effecting transcription of that DNA sequence. 
[0049] The precise nature of the regulatory regions needed for gene expression can vary between species or cell 
types, but includes, as necessary, 5' non-transcribing and 5' non-translating (non-coding) sequences involved with ini- 
tiation of transcription and translation respectively, such as the TATA box, capping sequence, CAAT sequence, and the 
like, with those elements necessary for the promoter sequence being provided by the promoters of the invention Such 

T5 transcriptional control sequences can also include enhancer sequences or upstream activator sequences, as desired. 
[0050] The vectors of the invention can further comprise other operably linked regulatory elements such as DNA 
elements which confer antibiotic resistance, or origins of replication for maintenance of the vector in one or more host 
cells. 

[0051] In another embodiment, especially for maintenance of the vectors of the invention in prokaryotic cells, or in 
20 yeast $. cerevisiae cells, the introduced sequence is incorporated into a plasmid or viral vector capable of autonomous 
replication in the recipient host Any of a wide varieiy of ve<;iurb can be wrrjjkjyed for this purpose In Saculus hosts, 
integration of the desired DNA may be necessary. 

[0052] Expression of a protein in eukaryotic hosts such as a human cell requires the use of regulatory regions func- 
tional in such hosts. A wide variety of transcriptional and translations regulatory sequences can be employed, depend- 

25 ing upon the nature of the host. Preferably, these regulatory signals are associated in their native state with a particular 
gene which is capable of a high level of expression in the specific host cell, such as a specific human tissue type. In 
eukaryotes, where transcription is not linked to translation, such control regions may or may not provide an initiator 
methionine (AUG) codon, depending on whether the cloned sequence contains such a methionine. Such regions will, 
in general, include a promoter region sufficient to direct the initiation of RNA synthesis in the host cell 

30 [0053] tf desired, the non -transcribed and/or non -translated regions 3' to the sequence coding for the merlin protein 
can be obtained by the above-described cloning methods. The 3' -non -transcribed region of the native human merlin 
gene can be retained for its transcriptional termination regulatory sequence elements, or for those elements which 
direct polyadenylation in eukaryotic cells. Where the native expression control sequences signals do not function satis- 
factorily in a host cell, sequences functional in the host cell can be substituted. 

35 [0054] It may be desired to construct a fusion product that contains a partial coding sequence (usually at the amino 
terminal end) of a first protein or small peptide and a second coding sequence (partial or complete) of the merlin protein 
at the carboxyi end. The coding sequence of the first protein can, for example, function as a signal sequence for secre- 
tion of the merlin protein from the host cell. Such first protein can also provide for tissue targeting or localization of the 
merlin protein if it is to be made in one cell type in a multicellular organism and delivered to another cell type in the same 

40 organism. Such fusion protein sequences can be designed with or without specific protease sites such that a desired 
peptide sequence is amenable to subsequent removal. 

[0055] The expressed merlin protein can be isolated and purified from the medium of the host in accordance with 
conventional conditions, such as extraction, precipitation, chromatography, affinity chromatography, electrophoresis, or 
the like. For example, affinity purification with zrti\~merlin antibody can be used. A protein having the amino acid 
45 sequence shown in Figure 3 can be made, or a shortened peptide of this sequence can be made, and used to raised 
antibodies using methods well known in the art. These antibodies can be used to affinity purify or quantitate merlin pro- 
tein from any desired source, 

[0056] If it is necessary to extract merlin protein from the intracellular regions of the host cells, the host cells can be 
collected by centrifugation, or with suitable buffers, lysed. and the protein isolated by column chromatography, for exam- 
so pie, on DEAE -cellulose, phosphocellulose. polyribocytidyiic acid-agarose, hydroxyapatite or by electrophoresis or 
immunoprecipitation. 

[0057] It is to be understood that all of the above procedures that are applicable to donmg and expressing merlin 
sequences apply equally to normal and mutant sequences. Mutations may be in any of the regions, i.e.. coding, non- 
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extended NF2 family originally employed to map the genetic defect on chromosome 22 is exemplified below 
(Wertelecki, W. et aL New Engl. J. Med 3t9:278-283 (1988); Narod, S.A. et a/., Am. J. Hum. Genet. 5/ 486-496 
(1992)) The characterization ol this mutation allows the early detection of the disorder in "at risk" family members 
before clinical symptoms appear. Although the exemplary material is directed to the detection of the disorder in a spe- 
cific kindred, the exemplary methods can be applied to any family to define the precise molecular lesion underlying NF2, 
and this information can then be used to accurately determine whether at-risk members ol the family have inherited the 
disorder 

[0059] Further, this information can be used to assess any given individual for the merlin genotype in that individ- 
ual Therefore, affected or other (e.g., carrier) individuals can be assayed for the presence of new mutations as well as 
for the presence of mutations that are previously defined. Accordingly, the exemplary material shows a preferred 
approach to isolating and characterizing the mutations associated with NF2 or other merlin -dependent pathologies. 
[0060] New mutations in any desired kindred or affected individual or individual suspected of being affected can be 
defined using single-strand conformational polymorphism (SSCP) and sequence analysis of DNA amplified from the 
NF2 gene merlin. DNA alterations in the merlin coding sequence cause a shift on SSCP gels that is characteristic of 
the disease chromosome being transmitted with the disorder and present in only affected members of the pedigree. For 
example, as described below, an A -» T transversion causes substitution of a tyrosine for an asparagine at position 220 
of the merlin protein, in a region highly conserved in closely related members of the family of cytoskeletal-associated 
proteins. This alteration caused a shift on SSCP gels that was characteristic of the disease chromosome in this NF2 
pedigree, being transmitted with the disorder, present only in affected members of the pedigree, absent in unaffected 
members of the family and absent from 158 unrelated individuals. Because of this identification, it is now possible to 
significantly aiier ine management of "at risk" members of this extended kindred, based on a prsciss definition of which 
individuals carry the disease gene, and which have escaped inheritance of the defect. A similar approach is applicable 
to defining the underlying lesion/DNA defect and thereby improving presymptomatic or prenatal diagnosis in any other 
NF2 family that does not have this specific mutation. As shown below in the exemplary material, an array of various 
mutations have been associated with the merlin locus. Accordingly any of these relevant mutations may be used for 
purposes of diagnosis in specific kindreds or individuals. 

[0061] Accordingly a combination of amplification, fa example using the polymerase chain reaction (PCR) with 
SSCP, permits the identification of other mutations in a merlin gene as simple as single base changes Any regions of 
the merlin gene can be assessed by this procedure: The sequence now made available by the inventors allows the 
design of primers and amplification of any desired region including coding, intronic, and regulatory non-coding or non- 
transcribed. 

[0062] The SSCP technique uses strategically placed primer sets to amplify small regions of the NF2 gene directly 
from genomic DNA using PCR The double-stranded PCR product containing the amplified region can then be used as 
a template to generate single strands by priming multiple rounds of DNA synthesis with one of the oligonucleotides pre- 
viously used in a double strand reaction. The amplified products can be used for SSCP analysis, cloned, and 
sequenced. 

[0063] The amplification products, denatured to separate the complementary DNA strands, are diluted to allow 
each single-stranded DNA molecule to assume a secondary structure conformation by folding on itself. The single- 
stranded molecules are then subjected to electrophoresis, tor example by polyacrylamde, under non-denaturing con- 
ditions. The secondary structure, which is highly dependent on the precise DNA sequence, affects mobility of the strand 
on the gel. Even a single base change or deletion can produce a visible shift in the final band position on the gel. Altered 
SSCP patterns thus can be analyzed based on a comparison with the SSCP patterns from affected family members, 
unrelated individuals, sporadic tumors, et cetera. The altered SSCP fragment pattern can then be correlated with inher- 
itance of the disorder and associated with affected members of the pedigree. 

[0064] Following SSCP analysis, the precise DNA alteration that causes the shifting mobility pattern can be ascer- 
tained by direct DNA sequencing of the PCR amplification product from which the altered single -stranded molecule was 
derived. 

[0065] After a specific mutation is confirmed following PCR and SSCP analysis and direct sequencing, the pres- 
ence of this mutation can be rapidly confirmed for any desired member of the kindred or otherwise by performing the 
same type of analysis. 

[0066] Alternatively, an approach may be taken exploiting restriction enzyme digestion following PCR amplification 
m the area of the defined mutation. If the defined mutation creates a new restriction site, the presence of this mutation 
can be rapidly confirmed in other members of the kindred or otherwise by using primers in the area of mutation to 
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amplified using primers in the mtron flanking exon 7. SSCP analysis was then performed. The amplified products were 
also cloned and sequenced, tntron DNA sequence flanking each exon was determined. DNA primers that permit ampli- 
fication of the exon sequences from genomic DNA using PCR were developed A mobility shift was detected in the 
amplification products of exon 7 comprising base pair 819 to 894 of the merlin coding sequence. DNA sequencing was 
used to confirm the precise DNA alteration that caused the shifted mobility pattern in the SSCP analysis. 
[0068] Normal merlin DNA displays an AAC codon encoding asparagine at position 220 of the merlin protein. The 
DNA with the shifted mobility reveals both an A and a T residue at the first position of this codon, suggesting a normal 
AAC codon on one chromosome and a mutated TAC on the other. The TAC would substitute a tyrosine at position 220. 
[0069] The sequence change created a GTAC stretch that can be recognized by the restriction endonuclease ftsal. 
The site created nine base pairs 3' to a preexisting Rsa! site in this exon. Rsa digests on the amplified PCR product 
confirm the presence of the same DNA change in other affected members of the family, its cotransmission with NF2, 
and its absence in unaffected members of the pedigree. Unaffected members display two Rsa I fragments of 96 base 
pairs and 76 base pairs, whereas the affected individuals produce an additional third fragment of 67 base pairs gener- 
ated from the Rsa site created by the A -> T change at codon 220. 

[0070] Accordingly, in one embocfment of the invention, this specific mutation could be identified in individuals hav- 
ing or suspected of having NF2 by simple assays, e.g. , Southern blots, involving restriction enzyme digestion with Rsa I. 
Similarly, if mutations in merlin create or abolish other restriction enzyme sites, this alteration can be exploited to rec- 
ognize the mutation in individuals other than the one in whom the mutation was discovered by restriction enzyme cleav- 
age assays. 

[0071] In specific embodiments of the invention, SSCP analysis was performed to scan all seventeen exons of the 
entire merlin gene for mutations. In schwannomas from NF2 patients, base changes, deletions, and insertions were 
observed at various locations which resulted in missense, frameshift, and possible splice donor and splice acceptor 
alterations The NF2 gene was also examined in sporadic schwannomas. Deletions, base changes, and insertions were 
observed in various locations in both intron and exon sequences. These mutations created frameshift, nonsense, and 
missense mutations, as well as actual or presumed alterations in the splice acceptor site, splice donor site, or acceptor 
branch site. Accordingly, in specific embodiments of the invention, these mutations may be useful as standards of com- 
parison for the examination of at-risk individuals. 

///. Use Of Merlin For Diagnostic And Treatment Purposes 

[0072] It is to be understood that although the following discussion is specifically directed to human patients, the 
teachings are also applicable to any animal that expresses merlin and in which loss or mutation of merlin leads to path- 
ological manifestations as in the human patient. 

[0073] It is also to be understood that the methods referred to herein are applicable to any me/7/n-associated tumor 
or subject suspected of developing or having such a tumor, whether such tumor is sporadic or associated with a condi- 
tion such as NF2. 

[0074] A "mert/n-associated" tumor is a tumor characterized in that the growth of such tumor reflects a decrease, 
functional alteration, or lack of merlin activity, especially if the decrease, change, or lack of merlin activity reflects a 
mutation or loss of the merlin gene or its regulatory regions. An example of such a tumor would include, but is not lim- 
ited to, a schwannoma, (such as, for example, a vestibular schwannoma, and especially a bilateral vestibular schwan- 
noma, a schwannoma of a cranial nerve, especially on the vestibular branch of the eighth cranial nerve, or a 
schwannoma of a spinal nerve root), or a meningioma (such as, for example, a meningioma of a cranial nerve, a ves- 
tibular meningioma, or a meningioma of a spinal nerve root). 

[0075] The diagnostic and screening methods of the invention are especially useful for a person suspected of being 
at risk for dweloping me/7/n-associated tumor or disease and/or NF2 based on family history, or a person in which it is 
desired to diagnose or eliminate the presence of a me rhn- associated condition or tumor or the NF2 condition as a caus- 
ative agent behind a tumor growth. 

[0076] By "predisposition to develop a merlin -associated tumor" is intended a genotype wherein a subject has the 
tendency to develop a genotype leading to the expression of an aberrant merlin gene. This could involve a subject het- 
erozygous tor a merlin mutation who subsequently becomes homozygous fa the mutation such that the mutation is 
now expressed Such a predisposition can be detected by molecular assays capable of detecting mutations or molec- 
ular changes in DNA, RNA, or merlin protein. Thus, even though a subject may be heterozygous lor the merlin muta- 
tion, the mutation, rf expressed on the affected chromosome, could then be detected, or, alternatively, the mutation in 
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1) Bilateral eighth nerve masses seen with appropriate imaging techniques (for example, computerized tomo- 
graphic or magnetic resonance imaging), 

2) A first-degree relative with NF2 and either unilateral eighth nerve mass or two of the following neurofibroma, 
meningioma, glioma, schwannoma, and juvenile posterior subcapsular lenticular opacity. 

[0078] According to the invention, presymptomatic screening of an individual in need of such screening is now pos- 
sible using DNA encoding the merlin protein of the invention, and specifically, DNA having the sequence of the native 
human merlin gene. The screening method of the invention allows a presymptomatic diagnosis, including prenatal diag- 
nosis, of the presence of a missing or aberrant merlin gene in individuals, and thus an opinion concerning the likelihood 

ic that such individual would develop or has developed we^n-associated tumors and/or NF2. This is especially valuable 
for the identification of carriers of altered or missing merlin genes, for example, from individuals with a family history of 
me'//n-associated tumors and/or NF2. This is also especially valuable for those patients where the chances of hearing 
preservation are optimal with early microsurgical removal of a vestibular schwannoma. Early diagnosis is also desired 
to maximize appropriate timely intervention as to any expected sequelae of the patient's tumor growth and lens opaci- 

15 ties. 

[0079] For example, in the method of screening, a tissue sample would be taken from such individual, and 
screened for (1 ) the presence of the 'normal' human merlin gene, (2) the presence of merlin mRNA and/or (3) the pres- 
ence of merlin protein. The normal human gene can be characterized based upon, for example, detection of restriction 
digestion patterns in 'normal' versus the patient's DNA. including RFLP analysis, using DNA probes prepared against 

20 the merlin sequence (or a functional fragment thereof) taught in the invention. Similarly, merlin mRNA can be charac- 
teri/Ari and f;omparAd tn norma) m^rijn mRKiA (a) levels and/or (b) size as found in a human population net 2t rick of 
developing me/7/n-associated tumors and/or NF2 using similar probes. Lastly, merlin protein can be (a) detected and/or 
(b) quantitated using a biological assay for merlin activity (its ability to suppress tumor growth) or using an immunolog- 
ical assay and anti- merlin antibodies. When assaying merlin protein, the immunological assay is preferred for its speed. 

25 [0080] An (1) aberrant merlin DNA size pattern or sequence, and/or (2) aberrant merlin mRNA size, level, or 
sequence, and/or (3) aberrant merlin protein or level thereof would indicate that the patient is at risk for developing a 
r7?er//'n-associated tumor and/or NF2 and is likely to develop a merlin- associated tumor and/or NF2. 
[0081] Similarly, if the tissue sample was derived from a tumor taken from a patient suspected of having a merlin- 
associated tumor and/or NF2, then (1) aberrant merlin DNA size, pattern, or sequence, and/or (2) aberrant merlin 

30 mRNA size, sequence, or level and/or (3) aberrant merlin protein or levels thereof would indicate that the patient has 
developed a mer/fn-associated tumor and/or NF2 These tumors can be treated with the methods of the invention as 
described below. 

[0082] In accordance with the inventors' characterization of a specific merlin mutation in a particular kindred, and 
the guidance provided to extend the knowledge and approaches disclosed herein to the identification of other mutations 
35 and the identification of the mutation disclosed herein in other kindreds, preferred methods of screening tissue samples 
from presymptomatic, asymptomatic, or symptomatic individuals involve, rather than comparison with normal merlin 
protein, DNA, or RNA, direct detection of abnormal merlin genes and gene products. 

[0083] Accordingly, a preferred strategy for identifying DNA sequences representing potential mutations within 
amplified coding sequences is the use of SSCP analysis combined with amplification and DNA sequencing. Accord- 

*c ingly, screening individual exons from a subject's DNA that have been amplified by PGR is a first approach. SSCP fol- 
lowed by direct DNA sequencing is then performed. In more preferred embodiments, mutations previously identified 
using this protocol provide a standard for comparison of the tissue sample to be assayed. The sample is thus amplified 
using primers known to be adjacent to the mutation and the amplification product either subjected to SSCP or subjected 
to restriction enzyme analysis in the case wherein the mutation creates or abolishes restriction sites found in the normal 

45 merlin gene. 

[0084] Similarly, mutant mRNAs may also be the basis for assay of a subject's RNA in Northern blots where the 
abnormal RNA has a characteristic pattern as in electrophoresis. Alternatively, cDNA transcripts from the RNA of a sub- 
ject can be analyzed by comparing such transcripts to the transcripts from known mutant merlin genes. 
[0085] Alternatively, aberrant merlin proteins with characterized mutations may serve as the basis for comparison 
sc with proteins derived from the individual undergoing the diagnostic treatment. Thus, recognition by monoclonal or pol- 
yclonal antibodies using standard immunological assays may reveal the presence of mutated proteins that can be iden- 
trfied by comparison with previous mutations. Alternatively, methods of identifying mutant proteins using known mutants 
as comparisons include, but are not limited to, tryptic peptide digests. 

[0086] Accordingly a repository of mutant DNA RNA/cDNA and nroteir patterns qath^rpd from an analysis of the 
,ecuence oe ^sea as a prooe ^atner. t s oniy necessary to use a fragment ot engtn ot nucleic acic tnat >s sufficient 
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to detect the presence of the merlin gene in a DNA preparation from a normal or affected individual, the absence of 
such gene, or an altered physical property of such gene (such as a change in electrophoretic migration pattern). 
[0088] Prenatal diagnosis can be performed when desired, using any known method to obtain fetal cells, including 
amniocentesis, chorionic villous sampling, and fetoscopy. Prenatal chromosome analysis can be used to determine if 

5 the portion of chromosome 22 possessing the normal merlin gene is present in a heterozygous state. 

[0089] The merlin DNA can be synthesized, and, if desired, labeled with a radioactive or nonradioactive reporter 
group, using techniques known in the art (for example, see Eckstein, F, ed., Oligonucleotides and Analogues: A Prac- 
tical Approach. IRS Press at Oxford University Press, New York, 1992); and Kricka, L.J., ed., Nonisotopic DNA Probe 
Techniques, Academic Press. San Diego, (1992)). 

<o [0090] Although the method is specifically described for DNA-DNA probes, it is to be understood that RNA pos- 
sessing the same sequence information as the DNA of the invention can be used when desired. 
[0091] In the method of treating NF2 in a patient in need of such treatment, functional merlin DNA is provided to 
the cells of such patient, especially the tumor ceils, in a manner and amount that permits the expression of the merlin 
protein provided by such gene, for a time and in a quantity sufficient to treat such patent. Many vector systems are 

is known in the art to provide such delivery to human patients in need of a gene or protein missing from the cell. For exam- 
ple, retrovirus systems can be used, especially modified retrovirus systems and especially herpes simplex virus sys- 
tems, such as those described in U.S. Appl No. 07/913,977 (filed July 16, 1992); U.S. Appl No 07/956,949 (filed 
October 6, 1992), U.S. Appl. No. 07/895,364 (filed June 9, 1992); each incorporated herein fully by reference. In addi- 
tion, such methods are provided for. in, for example, the teachings of Breakefiekj, X.A. et ah, The New Biologist 3:203- 

20 218 (1991), Huang, Q. et ai, Experimental Neurology 1 75:303-316 (1992), WO93/03743 and WO90/09441 each incor- 
porated rwftin fully hy reference 

[0092] Delivery of a DNA sequence encoding a functional merlin protein, such as the amino acid encoding 
sequence of Figures 3 and 15A-15Q, will effectively replace the missing or mutated merlin gene of the invention, and 
inhibit, and/or stop and/or regress tumor growth that arose due to the loss of the merlin tumor suppressor. 
25 [0093] This method is especially effective in the tumor types such as those classically associated with NF2. and 
especially with a schwannoma, (such as, for example, a bilaterai vestibular schwannoma, a schwannoma of a cranial 
nerve, especially on the vestibular branch of the eighth cranial nerve, or a schwannoma of a spinal nerve root), menin- 
giomas (such as, for example, a meningioma of a cranial nerve, a vestibular meningioma, or a meningioma of a spinal 
nerve root). 

30 [0094] The method of the invention is also useful to treat conditions such as posterior capsular lens opacities, deaf- 
ness, balance disorder, paralysis or other neurological problem when such problem is due to the presence of a merlin- 
associated tumor or NF2 condition. 

[0095] The manner and method of carrying out the present invention can be more fully understood by those of skill 
by reference to the following examples, which examples are not intended in any manner to limit the scope of the present 
35 invention or of the claims directed thereto. 

Examples 

Experimental Procedure for Examples 1-4 

40 

NF2 Cell Lines 

[0096] Lymphobiast ceil lines were established (Anderson, M A. et ai., In Vitro 20:856-858 (1984)) from affected 
members of NF2 pedigrees and from their unaffected relatives. Diagnosis of NF2 conformed to the criteria set forward 

45 by the National Institutes of Health Consensus Development Conference on Neurofibromatosis (Mulvihill, J.J. et ai, 
Ann. Intern. Med. 1 13.39-52 (1990)), except for the patient whose meningioma displayed a 4 bp deletion. This patient 
had a right vestibular schwannoma, and multiple meningiomas. Although she did not have a history of NF2, she prob- 
ably represents a new mutation. Primary meningioma cells were cultured as described (Logan, J. A. et al. t Cancer 
Genet. Cytogenet 45:41-47 (1990)) and analyzed after less than five passages. 

so [0097] Somatic cell hybrids were prepared by fusing GUS5069 lymphoblasts with a Chinese hamster cell line defi- 
cient in HPRT activity (CHTG49); Athwall, RS. etal., Proc. Natl. Acad. So. US A. 74:2943-2947(1977)) using GIBCO 
PEG 4,000. Fused cell lines were selected by their ability to grow in media containing hypoxanthine, aminopterin and 
thymidine (HAT) Hybrids were screened for the chromosome 22 homologues using the polymorphic SSR marker, 

r OP^; fTrofatter J A #t Vnl H^npf * 455 (1992)) Control hybrids OM1 0888 and Eye3F*fi 'NA10027Wp 
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DNA/RNA Blotting 

[0098] DNA was prepared from cultured cells and DNA blots prepared and hybridized as described (Guselta, J F 
era/., Proc. Nat!. Acad. So. U.S.A. 75:5239-5243 (1979); Gusella, J.F etal.. Nature 306:234-238 (1983)). For pulsed- 
field gel analysis, agarose DNA plug preparation, and electrophoresis were carried out as described (Bucan, M. et a/., 
Genomics 6:1-15 (1990)) RNA was prepared and Northern blotting performed as described in Buckler et ai Buckler, 

A.J. era/., Proc. Natl. Acad. Set. U.S.A. 55:4005-4009 (1991). 

Cosmid Walking 

[0099] The NEFH probe used for blot analysis and to initiate cosmid walking was pJL215, representing a 4.4 kb 
Kpnl/Xbal genomic fragment containing exon 4 and 3'UTR (Lees, J.F et ai. EMBOJ. 7:1947-1955 (1 988)). The NEFH 
probe pJL215 was obtained from Dr. Greg Elder and Dr. Robert Lazzarini, The Laboratory of Molecular Genetics, 
National Institute of Neurological and Communicative Disorders and Stroke, National Institutes of Health, Bethesda, 
MD, 29892. Cosmid walking was performed in an arrayed cosmid library prepared from DNA of flow- sorted human chro- 
mosome 22 (LL22NC03); Dr. Pieter DeJong, Lawrence Livermore National Laboratory). Cosmid overlaps were identi- 
fied by either hybridization of whole cosmid DNA or isolated fragments to filter replicas of the gridded arrays, or by PCR 
screening of row and coiumn DNA pools. STSs were developed by direct cosmid sequencing using the T3 or T7 end- 
primers (McClatchey, A.I. etal., Hum. Moi. Genet. 1 521-527(1992)). 

cDNA isolation ana Characterization 

[0100] Human frontal cortex and hippocampus cDNA libraries in lambdaZAPII (Stratagene) were screened using 
exon probes isolated and prepared as described by Buckler et al. (Buckler, A.J. et ai, Proc. Natl. Acad. Sci. U.S.A. 
85:4005-4009 (1991)). cDNA clones and trapped exon were sequenced as described (Sanger T. et a!., Proc. Natl. 
Acad. Sci. U.S.A. 74 5463-5467 (1977)). Direct PCR sequencing was performed as described (McClatchey, A.I. et al., 
Ceil 65:769-774 (1992)). Screening for variations by SSCP analysis followed the procedure described in Ambrose et 
ai (Ambrose, G. et a/., Hum. Moi. Genet. 7:697-703 (1992)). RNA was reverse transcribed using an oligo(dT) primer 
(BRL reverse transcriptase) to prepare first strand cDNA. Portions of the cDNA were amplified using the following 
primer sets: 

5'CCAGCCAGCTCCCTATGGATG3' [SEQ ID No: 1 1] and 
5 AGCTGAAATGGAATATCTGAAG3' [SEQ ID No: 12] 

to amplify bp 824-2100 and 

5'GCCTTCTCCTCCCTGGCCTG3' [SEQ ID No 13] and 
5'GATGGAGTTCAATTGCGAGATG3' [SEQ ID No: 14] 

to amplify bp 314-1207. These cold PCR products were then reamplified with specific regional primers tor SSCP as 
described in the legend to Figures 6, 6A and 6B. 

Example 1 

Scanning the NF2 Candidate Region for Rearrangement 

[0101] The regton on chromosome 22 that was examined for the presence of the NF2 gene was between C22Si 
and D22S28\ this region was estimated to encompass 6 Mb of band q12 (Frazer, K A. et ai, Genomics 74 574-584 
(1992)). 

[0102] By scanning the human chromosome 22 region between D22S1 and D22S28 for loss of DNA it was deter- 
mined whether some germline NF2 mutations might involve a deletion of the tumor suppressor gene as has been found 
in Wilms' tumor and retinoblastoma (Riccardi, V.M. et at.. Pediatrics 57 604-610 (1978); Francke, U etal., Cytogenet 
Ceil Genet 24:185-192 (1979); Dryja, TP et al.. Proc. Natl Acaa. Sci. U.S.A. 83:7391-7394 (1986)) Pulsed-field gel 

om a female N* I oat e M a p-ooe tcr tre * 'oots nvDncizea to apparency atiereo fragments ut fecuceo a \- 

ootn Notl and Nrui. 
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[0103] In Not I -digested DNA (Figure 1), the NEFH probe detected fragments of approximately 600 Kb, 400 Kb and 
230 kb in most lymphoblast cell lines. It was not possible to confirm that the 600 kb fragment originated from chromo- 
some 22. Thus, it was possible that hybridization had occurred with a related tocus (Menon et ai , unpublished results) 
Variable intensity of the 230 kb fragment in many samples suggested that it resulted from partial digestion of the 400 
kb fragment. In GUS5069, additional fragments of approximately 370 kb and 200 kb were observed. These results are 
consistent with the possibility of a deletion within the region common to the 400 kb and 230 kb fragments. The alteration 
was transmitted abng with NF2 from the patient to her affected daughter (represented by GUS5068) (Figure 1) 

Example 2 

Chromosome Walking Toward the Merlin Gene 

[0104] To isolate DNA corresponding to the region of chromosome 22 apparently deleted in GUS5069, a bi-direc- 
tional cosmid walk was initiated from NEFH. At each step, single restriction fragments of the cosmids were used as 
probes on pulsed-field gels to establish the location of the putative deletion relative to NEFH. On the 5' side of NEFH, 
a Notl site that was rarely cleaved in lymphoblast DNA was identified. Probes beyond the Notl site detected the same 
approximately 400 kb Notl fragment along with a 1 70 kb fragment of variable intensity. Thus, infrequent cleavage of this 
Notl site divides the 400 kb fragment into fragments of 230 kb and 170 kb. Since the putative deletion in GUS5069 
affects the 230 kb fragment but not the 1 70 kb fragment, further experiments continued to walk only 3' of NEFH. Pulsed- 
fiekj gel blots containing DNA from GUS5069 were again probed, 

[0105] The Not! pulsed-field gel map and a minimal set of clones representing the cosmid waik and the extent of 
the genomic deletion (see Figure 5A) are shown in Figure 2. The deletion was reached when a probe (Figure 2, Probe 
A) was tested from cosmid 96C10 which failed to detect the altered Notl fragment in GUS5069. However, various 
probes from cosmid 28H6 and 1 21 G 10 did detect the altered fragment. 

[0106] To estimate the extent of deletion, probes B and C (Figure 2) were tested. Probe B is an 8 kb Hindlll frag- 
ment from 28H6 which overlaps with the T3 end of 96C10. Probe C is a 9 kb Hindlll fragment from the T7 end of 96C10. 
Probe B detected both the normal and the altered Notl PFG fragment, but probe C detected only the normal fragment. 
[0107] For more precise analysis of the deletion, the altered chromosome 22 from GUS5069 was segregated from 
its normal counterpart in human X hamster somatic cell hybrids. STS assays for the T3 and T7 ends of 96C10 were 
created and hybrids containing the separated chromosomes 22 were tested. In contrast to the above hybridization 
results, the T3 end of 96C10 was absent in hybrid GUSH134A3, containing the deleted chromosome but present in 
GUSH134A10 containing the normal chromosome. Moreover the T7 assay was positive in both hybrids. The locations 
of probes B and C, and of both STS assays were confirmed on the cosmid walk. Thus, the failure of probe C to detect 
the altered fragment suggests that the deletion spans most but not all of this sequence. Similarly, the other deletion 
breakpoint must occur within the region spanned by probe B. Therefore, the results of hybridization and PGR indicate 
that the deletion must encompass almost all of 96C10 and up to an additional 5 kb of 28H6. This 35-45 kb region is 
expanded below the cosmid walk in Figure 2 

Example 3 

Identification and Characterization of the merlin cDNA 

[0108] Exon amplification (Buckler, A.J. et ai. Proa Natl. Acad. Set. USA 55:4005-4009 (1991)), which produces 
cloned "trapped exons," was applied to cosmids 28H6, 96C10, 121G10, 123F5, 10H11, and 7C4 surrounding the site 
of the NF2 deletion as a rapid method of obtaining exonic probes for cDNA cloning. Each exon clone can represent a 
single exon. or multiple exons spliced together in the trapping procedure. Twenty-four exon clones were obtained and 
sequenced, 6 of which displayed sequence similarity with the cytoskeleton-associated proteins moesm, eznn and 
radixin (see below). The latter exons were used to screen human frontal cortex and hippocampus cDNA libraries. 
[0109] Figures 3A to 3F show the complete DNA sequence of JJR-1, the longest clone obtained In cDNA screen- 
ing. JJR-1 has been deposited with the American Type Culture Collection and assigned ATCC75509. This sequence 
contains eight of the cloned exon segments as shown in Figure 5. The cDNA is 2,257 bp long and shows no evidence 
of a poly(A) tail. However, two shorter cDNA clones, JJR-6 and JJR-9, which overlapped the restriction map of JJR-1 , 
had apparent poly(A) tails beginning at base 2231 JJR-1 contains an open reading frame of 1 785 bp. encoding a pre- 

-.v^,,. ;■>.-< < - n < ~ T i ,..<f. • ,. •- •'• .--^ ,.♦'.( K <■ A '* h ' ■ ^ ' ■■[' p c+ ' ^"V^ r ^ thp Dl/tativ^ "Viator ^plHmnrp 

m in) ^ oth ^np.j.jH-i sequence ana tne Dfedictea O'Otem product wereusec to seaicn tor similarity <" nucie^ 
acid and protein databases using the BLAS^ network service of the National Center for Biotechnology information (Ait 
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schul. S.F. ef a/.. J. Moi Biol. 275:403-410 (1990)). The DNA sequence displayed significant similarity to moesin and 
ezrin genes from several species, including man (P=9.0e' 125 and 9.0e 122 ) ( to mouse radixin (Me' 102 ) and to Echino- 
coccus muitiloculans tegument protein (2.4e* 21 ). Striking similarity was also detected at the amino acid level with these 
same proteins (2.5e 146 , 5.0e" 145 , 2.7e" 145 and 7.6~ 73 , respectively) and to a potential product of a sequence tag from 
5 Caenorhabditis eiegans (3.7e 43 ). Weaker similarities were detected to the sequences of two protein tyrosine phos- 
phatases. PTP-MEG and PTP-H1 (1 3e* 17 and 9.6" 16 , respectively), to erythrocyte protein 4.1 (9.9e' 14 ) and to a wide 
range of myosin, tropomyosin and paramyosin proteins Because this novel gene is most closely related to moesin. 
ezrin and radixin (45-47% identity), it is called "merlin" 

[01 1 1 ] Northern Wot analysis using total RNA from various cultured human tumor cell lines (Figure 4) revealed two 
ig major hybridizing species of 2.6 kb and 7 kb, and a less intensely hybridizing RNA of 4 4 kb. A similar pattern was 
detected in poly(A) + RNA from various human tissues, including heart, brain, lung, skeletal muscle, kidney, pancreas 
and weakly in liver indicating that the merlin gene is expressed widely The apparent poly(A) tails detected in JJR-6 and 
JJR-9 suggest that these clones may have derived from the approximately 2.6 kb RNA. The JJR-1 clone likely derived 
from one of the larger RNAs which apparently has a much longer 3' UTR. However, it cannot be excluded that the larger 
ts RNAs arise by alternative splicing that alters the length and composition of the coding sequence or by hybridization to 
related family members. 

Example 4 

20 Non-Overtapping Deletions Interrupt the Candidate NF2 Gene 

[0112] To determine whether the deletion detected in GUS5069 interrupts the merlin gene, exon probes were pre- 
pared from across the coding sequence (Figure 5) and Southern blots containing DNA from GUSH134A3 and 
GUSH134B1 (two independent hybrid lines containing the deleted chromosome 22) were analyzed. The results for 
25 probes I and II, shown in Figure 5A and Figure 5B respectively, demonstrate that the probe I sequence was absent from 
both hybrids, while the probe II sequence was present in both. Thus, she genomic deletion truncates the merlin gene 
within the coding sequence between probes I and II, removing the 5' end. 

[01 1 31 In a search for additional alterations in the merlin gene, blots of restriction -digested DNA from 33 unrelated 
NF2 patients were scanned using the cDNA as probe. One patient, represented by cell line GUS5722, displayed altered 

30 fragments with several restriction enzymes suggestive of a small -3.4 kb genomic deletion. This patient was analyzed 
using Southern blotting as shown in Figures 5C to 5E. Probes lit. IV and V all reside on the same 21 kb EcoRI fragment. 
In GUS5722, probes III and V detected both the normal EcoRI fragment and second fragment reduced in size by dele- 
tion. Probe IV failed to detect the altered fragment in GUS5722 because it lies within the region deleted. PCR amplrf i- 
cation of first strand cDNA from GUS5722 was performed and confirmed the presence of two types of PCR product 

35 (Figure 6). Direct sequencing revealed that the novel PCR product was missing bases 1 559 to 1 792 of the cDNA, rep- 
resenting deletion of at least two exons. The absence of this segment would remove 78 amino acids from the protein, 
while leaving the reading frame intact. The GUS5722 cell line was Generated from a member of a large NF2 kindred 
(Family 3 in Narod, S.A. et ai, Am. J. Hum. Genet 57 486-496 (1992)), and the deletion was present in five affected 
members and absent in eleven unaffected members of this pedigree. 

4c [01 14] The presence of non-overlapping deletions affecting the merlin gene in two independent families supported 
the conclusion that this gene represents the NF2 tumor suppressor. The presence of additional alterations were deter- 
mined by single-strand conformational polymorphism (SSCP) analysis of PCR amplified first strand cDNA from tumor 
and lymphobiast samples. mRNA from four primary cultures of meningiomas (3 from NF2 patients with a family history 
of the disorder, 1 from a probable new mutation to NF2) was used and only selective regions of the mRNA were ana- 

45 iyzed. Two of the tumors yielded aberrant patterns. 

r 01 1 5] A meningioma from a female patient likely to have NF2 (see Experimental Procedures) displayed a reduced 
^ze for the expected non-denatured PCR product on SSCP gels (Figure 6A). Tne vastly reduced level of the normal- 
sized PCR product suggests that this tumor had lost alleles in this region of chromosome 22. However, lymphocyte 
DNA was not available from this patient to confirm this, Direct sequence analysis of the PCR product confirmed the 

sc presence of a 4 bp deletion which removes bases 1 781 to 1 784. This deletion afters the reading frame and generates 
a shorter protein 

[01 1 6] A meningioma from a male patient with NF2 (see Experimental Procedures) displayed an altered pattern on 
SSCP analysis (Figure 6B). This meningioma was known to have lost heterozygosity on chromosome 22 based on 
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Discussion 

[0117] The delineation of non-overlapping deletions affecting different portions of the same chromosome 22 gene 
in two independent NF2 families is strong evidence that "merlin" is the NF2 tumor suppressor. Although it is possible 
5 that one or both of these deletions may affect a second gene in the area, should this gene in fact be the NF2 tumor sup- 
pressor, it would have to be affected by both deletions and must therefore be composed of exons interspersed with 
those of the merlin gene. 

[0118] The larger of the deletions truncates the 5' end of the merlin gene, removing at least 120 amino acids. In 
addition, the extent of this deletion suggests that the 5' regulatory elements may also be missing. The smaller germline 
;o deletion removes 78 amino acids from the C-terminal portion of the protein It is likely that such alterations would have 
drastic consequences tor the function of the merlin protein. 

[01 19] The four base pairs and single base pair deletions in meningiomas from unrelated NF2 patients could pos- 
sibly be of somatic origin and unrelated to the inherited predisposition, however, the almost exclusive expression of the 
altered copy of the media gene suggests that the normal sequence has been lost as a somatic event in tumor formation. 
is This is consistent with the tumor suppressor model, and would suggest that the frameshift alterations actually represent 
germline mutations in these patients. 

[0120] The merlin protein encoded at the candidate NF2 locus is a novel member of a growing family of proteins 
that have been proposed to act as links between the cell membrane and the cytoskeleton (Luna, E.J. et ai., Science 
255:955*964 (1992); Sato. N. et ai, J. Cell. Set. 703:131-143 (1992)). All members of the family (which includes 

20 moesin, ezrin, radixin, erythrocyte protein 4.1 and talin) contain a homologous domain of approximately 200 amino 
acids near the N-termmus toi lowed by a segment thai is predicted to be rich in a u-heliA structure, and a highly charged 
C-terminal domain. Where they have been characterized from more than one mammalian species, members of this 
family are remarkably conserved. Moreover, highly related genes have been detected in the nematode, Caenorhabditis 
elegans (Waterson, R. et ai. Nature Genet 7:1 14-123 (1992)), and in the parasitic cestode, Echinococcus 

25 muttilocularis (Frosch, P.M. etaL Mot. Biochem. Parasitol. 48:121-130). 

[0121] Although most distantly related to merlin, protein 4.1 and talin are the best studied members of this family 
of proteins and have contributed the most towards understanding the function of the gene family. Protein 4.1 plays a 
critical role in maintaining membrane stability and cell shape in the erythrocyte by connecting the integral membrane 
proteins glycophorin and protein 3 (the anion channel) to the spectrin -actin lattice of the cytoskeleton (Leto, x - et ai, 

30 J. Biol. Chem. 259:4603-4603 (1984); Conboy, J. et ai, Proc. Natl. Acad. So. U.S.A. 55:9512-9516 (1986)) Genetic 
defects in protein 4.1 lead to one form of hereditary el'otocytosis (Tchernia, G. etaL, J. CHn. Invest 68:454-460 (1981); 
Delaunay, J. et ai, Nucleic Acids Res. 72:387-395 i -84)) The binding site for glycophorin in protein 4.1 has been 
mapped to the N-terminal domain, suggesting that the homologous region in other family members might also bind to 
proteins in the membrane (Leto, T.L et ai in Membrane Skeletons and Cytoskeietaf Membrane Associations, Bennett 

35 et ai. eds. Liss, New York (1986), pp. 201-209). Interestingly, a related domain is also found in two protein tyrosine 
phosphatases, PTP-MEG and PTP-H1, perhaps allowing these enzymes to associate with the membrane or the 
cytoskeleton (Gu, M. etai. Proc. Natl. Acad. Sci U.S.A. 55:5867-5871 (1991); Yang, Q etai., Proc. Natl. Acad. Sci. 
U.S.A. 88 5949-5953 (1991)). Binding of protein 4.1 to spectrin is mediated by the a-helical region of the protein, sug- 
gesting that the analogous segments of the other family members might also bind to cytoskeletal components (Correas, 

40 |. et ai, J. Biol. Chem 261 :3310-3315 (1986)) Talin, a large protein found in regions of focal adhesions at cell-cell or 
cell-substrate contacts, appears to behave similarly, binding to the integrins in the cell membrane and to vinculin, 
thereby connecting the extracellular adhesion matrix to the cytoskeleton (Rees, D.J.G. et ai, Nature 347:685-689 
(1990), Luna, E.J. et ai. Science 255 955-964(1992)). 

[0122] Moesin, eznn and radixin are highly related proteins ( - 70-75 % amino acid identity) that have each been 
*5 postulated to provide a link between the cytoskeleton and the cell membrane. Each of these proteins shares 45-47% 
amino acid identity with merlin Moesin (membrane-Q/ganizing extension spike p/otein), originally proposed as a recep- 
tor for heparin sulfate, has been found at or near the membrane in filopodia and other cell surface protrusions (Lankes. 
WT. et ai, Proc Natl. Acad. So. U.S.A. 55:8297-8301 (1991); Furthmayr, H. et ai. Kidney Int. 47:665-670 (1992)). 
Eznn (cytovillin) has been seen in association with microvilli and cellular protrusions in many cell types (Pakkanen, R. 
so et ai, J. Cell. Biochem. 35:65-75 (1988); Gould, K.L. etai. EMBO J. 5:4133-4142 (1989); Turunen, O. et ai, J. Bioi 
Chem 264 1 6727-16732 (1989); Hanzel, D. etai, EMBO J 70:2363-2373 (1991); Birbauer. E etai. J. Neurosci. Res 
30:232-241 (1989)) Rapid redistribution of eznn to regions of membrane remodeling, such as microvillar formation and 
membrane ruffling in response to growth factor stimulation, may be regulated by phosphorylation of the protein on both 

« ■'■ : - - : " ■ 1 039-1 048C 99V) interestingly m mitotic cens faaixir ^ ;,orcent;atea at ire aea/agt; rui ■ t;v. 
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[0123] Me^im possesses an N-termmal domain that is similar to protein 4.1 (28% identity), and to talin (21% iden- 
tity). It is much more closely related, however, to moesin, ezrin and radixin (Figure 7). Amino acid identity between 
merlin and the three latter proteins is concentrated in the first 342 residues (-63% identity). Like these other family 
members, the merlin protein is predicted to have a very long a-helical domain spanning 160-170 amino acds, begin- 
ning around residue 300. The first third of this domain overlaps with the region of strongest homology to moesin, ezrin 
and radixin. However, the remaining stretch shows limited similarity with these proteins and with a wide variety of 
myosins and tropomyosins. The C-terminal region of merlin contains a hydrophilic domain analogous to those of other 
family members The similarity in structure of merlin to the other members of this family suggests that it too may nor- 
mally act as a link between the cytoskeleton and the cell membrane and may thus represent a new class of tumor sup- 
pressor gene. 

[01 24] The cytoskeleton of mammalian cells is a complicated lattice-work of many different kinds of interconnected 
filaments (Luna, E.J. et ai,. Science 255:955-964 (1992)). It participates in a wide range of crucial cellular activities, 
including determining and altering shape, movement, cell division, cell -cell communication, cell anchorage, and organ- 
ization of the intracellular milieu (Bernal, S.D. etai, Cht Rev. Oncoi Hematoi 3:191-204 (1985)). A defect in a protein 
which connects some component of this network to the plasma membrane could affect any of these processes, and 
have a consequent effect on growth control. For example, inactivation of the merlin protein may disregulate growth by 
disrupting a signal transduction pathway, by altering anchorage dependence, by upsetting the cell cycle regulation, or 
by some other mechanism remains to be determined. However, the characteristic structure of the merlin protein sug- 
gests that a search for its membrane and cytoskeletaf binding targets might provide a logical route to exploring this 
question. 

Example 5 

Altered Coding Sequence of the Merlin Tumor Suppressor Permitting DNA Diagnosis in an Extended Pedigree 
with NF2 

[0125] The objective of this example was to define the DNA mutation causing NF2 in a large, well-studied NF2 ped- 
igree previously used to chromosomally map and to isolate the disease gene. The design was to use SSCP and 
sequence analysis of DNA amplified from the NF2 gene of affected and unaffected persons. The participants in the 
study set forth in this example were affected, unaffected, and at-risk members ol a large pedigree segregating NF2. The 
results of the study showed a DNA alteration in the merlin coding sequence causing a shift on SSCP gels that was char- 
acteristic of the disease chromosome in this NF2 pedigree, being transmitted with the disorder, present only in affected 
members of the pedigree, absent in unaffected members of the family, and absent from 158 unrelated individuals. The 
alteration caused substitution of a tyrosine for an asparagine at position 220 of the merlin protein, in a region highly con- 
served in closely-related members of the family of cytoskeletal-associated proteins. The DNA change could also be 
detected by restriction enzyme digestion with Rsa\. 

A. Materials and Methods 

1. Patients 

[0126] The family studied in this Example has been extensively characterized clinically and described in detail pre- 
viously (Wertelecki, W., et ai.. Mew Engl J. Med. 319:278-283 (1988)). Medical records, histologic slides, death certif- 
icates and autopsy reports were sought for all symptomatic family members. Clinical assessments were performed 
ncluding a search for signs of neurofibromatosis type 1 The results of computed tomography. MRI, and ophthalmologic 
and audiologic examinations were also sought. Diagnostic criteria used were those of the NIH consensus statement on 
neurofibromatosis (National institutes of Health Consensus Development Conference Statement: Neurofibromatosis 
A r ch Neurol. 45:575-578 (1988)). Lyrnphoblast lines were established from peripheral blood samples for all patients 
and relatives as previously described (Anderson et a!., in Vitro 20:856-858 (1984)). DNA was isolated from peripheral 
or cultured leukocytes as described herein. 

2. Polymerase Chain Reaction (PCR) Amplification of Exon 7 

:'tTP ri^T^ ^ -xi cT" T P 4 omoies ot eacn onmei 0 ; ) jnrs <dj polymerase < j trM inspr a .j, i <nM Mg^f - ■■ 
-J . and 0 1 mg/ml gelatin n a totai voume of 10 til. Each reacton was cycled 35 times jsmg the allowing steps aera 
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turation (at 95°C for 15 minutes), primer annealing (at 60°Cfor 1.5 minutes), and elongation (at 72°C for 30 seconds). 

3. Single Strand Conformational Polymorphism Analysis (SSCP) 

5 [0128] SSCP analysis was performed according to the procedure of Orita et ai. with minor modifications (Onta, M., 
et ai.. Genomics 5 874-879 (1989)). PCR amplification was carried out as described above except each reaction 
included 0.1 ^l (10 mCi/ml) of a- 32 P dATP (Amersham Life Sciences). The amplified products were diluted 1:20 in 
0.05% SDS, 6 mM EDTA, 40% formamde, 0.5 mg/ml xylene cyanol and 0.5 mg/ml bromphenol blue and heated to 
90°C for 3 minutes to denature the DNA. Samples were immediately cooled on ice and loaded on an 8% polyacrylamide 

:c gel containing 8% glycerol Electrophoresis was carried out at room temperature for 12 to 16 hours at a constant power 
of 6-10W. Gels were dried and exposed overnight to Kodak X-Omat AR film. 

4. DNA Sequencing 

is [01 29] The double-stranded PCR product containing the amplified exon was used as a template to generate single- 
strands by priming multiple rounds of DNA synthesis with one of the oligonucleotides previously used in the double- 
strand reaction (Gibbs, A., et aL Proc. Natl Acad. Sci. USA 55:1919-1923 (1989)). Conditions for the single strand- 
producing reactions were identical to the PCR amplification of individual exons as described above. After 30 cycles, the 
product was ethanol precipitated in the presence of ammonium acetate and resuspended in 7 yi\ water for subsequent 

20 sequence analysis. The sequencing reactions were performed as described in Park etat. (Hum. Mut 1 293-297 (1992)) 

obtained from the individual clones were sequenced using the Sequenase kit (US Biochemicals) 
B. Results 

25 

r. The Extended NF2 Pedigree 

[0130] The NF2 gene shown in previous Examples was mapped precisely using genetic linkage analysis in an 
extremely large disease pedigree from which 137 blood samples were tested for polymorphic DNA markers on chromo- 
30 some22(Wertelecki, W., et ai.. New Engl. J Med 379 278-283(1988)). For this Example, a subset of the members of 
this kindred was selected to use for identification of the underlying NF2 mutation. The relationships of the family mem- 
bers used is shown in Figure 8. 

2. Scanning for NF2 Mutation Affecting Expression of Merlin 

35 

[0131] The NF2 gene has been identified herein based on non-overlapping genomic DNA deletions that artered its 
coding sequence in three independent NF2 families and in a meningioma from an unrelated NF2 patient. The merlin 
mRNA sequence consists of more than 2250 bases. The translated portion of the mRNA, an open reading frame of 
1 785 bases encoding the predicted 595 amino acid merlin protein is bracketed by 5' and 3' untranslated regions whose 
40 extent and variability remain to be defined completely. The merlin mRNA is spliced together from at least 1 7 exons that 
are distributed across about 100 kb of chromosome 22. In order to scan for mutations in the NF2 gene, the intron DNA 
sequence flanking each exon was determined, and DNA primers that permit amplification of the exon sequences from 
genomic DNA using the PCR have been developed. 

[0132] The strategy for identrfying DNA sequence differences representing potential mutations within the amplified 
*s coding sequences is the use of SSCP analysis (Orita, M., era/., Genomics 5:874-879 (1989)). In this method, individual 
exons from the patient's DNA are first amplified by PCR. The amplification products are then denatured to separate the 
complementary DNA strands and diluted to allow each single stranded DNA molecule to assume a secondary structure 
conformation by folding on itself. The single stranded DNA molecules are then subjected to polyacrylamtde gel electro- 
phoresis under nondenaturing conditions. The secondary structure, which is highly dependent on the precise DNA 
so sequence, affects mobility of the strand on the gel. Even a single base change or deletion can produce a visible shift in 
the final band position on the gel 

[0133] The SSCP technique was applied to the DNA samples from the family in Figure 8. A mobility shift was 
detected when comparing amplified PCR products from the affected family member, designated by an arrow m Figure 

o rp| R | )vP j n *hr>SP from spvp'rI norma' urrplated individuals 3nd from rrpmbprs nf othpr urrplated NF? *amilips This 

:i:tor^i:' • - - q r p : ere m s 1 1 . " 'iisease -iii^ip ?no "qr^r^ vpo w!" r^e c.\so r ce< : "cirirra at'ecre*: i-x: •■-r t ^-~-- 

• ••nnei s 01 tne V d oecigree weie typea • igue 6 snows tne aeaucea genotype 01 eacn ramny rneno^i tesieu m :r 
ara other anai/ses Figure 9 dispia/s :ne results of one SSCP analysis ir which five stbiing pairs eacn consisting ot 
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one affected and one unaffected individual, were tested. The altered SSCP pattern shows a clear correlation with inher- 
itance of the disorder, and is present only in affected members or the pedigree. This SSCP pattern was not observed 
m a search of more than 300 independent chromosomes, derived from normal controls, other NF2 families, and spo- 
radic tumors of various types (colon, astrocytoma, schwannoma etc.) 

5 

3. Identification of the DNA Sequence Alteration Underlying the SSCP Variation 

[01 34] To determine the precise DNA alteration that caused the shifted mobility pattern in the SSCP analysis, direct 
DNA sequencing of the PCR amplification product from an affected member of the pedigree and an unrelated normal 
10 control was performed The change in DNA sequence in the affected individual and its absence in an unaffected mem- 
ber of the pedigree was reconfirmed using cloned PCR product. The results of sequence analysis are shown in Figures 
10A and 10B. 

[0135] The control sequence displays an AAC codon encoding asparagine at position 220 of the merlin protein. 
The DNA from the patient reveals both an A and a T residue at the first position of this codon, suggesting a normal AAC 
is codon on one chromosome, and a mutated TAC on the other. The TAC codon would substitute a tyrosine at position 
220. Since this SSCP variation was absent from more than 300 independent chromosomes, it is unlikely that this 
change is simply a polymorphism. Rather, the change from the aliphatic side chain of asparagine to the bulky aromatic 
side chain of the tyrosine is likely to have significant consequences for the structure of the protein. 

20 4. Confirmation of the Sequence Alteration by Rsal Digestion 

[0136] The sequence change outlined above creates a GTAC stretch that is recognized and cleaved by the restric- 
tion endonuclease Rsal. This site is created 9 bp 3' to a pre-existing Rsal site in this exon. Thus, to confirm the pres- 
ence of the same DNA change in other affected members of the family, its cotransmission with NF2, and its absence in 
25 unaffected members of the pedigree, we performed Rsal digests on the amplified PCR product from several family 
members. Typical results are shown in Figure 1 1 . The unaffected members of the family (U) and other control incfividu- 
als display two Rsal fragments of 96 bp and 76 bp, whereas the affected individuals (A) produce an additional third frag- 
ment of 67 bp generated from the Rsal site created by the A -» T change at codon 220. 

30 Discussion 

[0137] The isolation and characterization of the NF2 gene, encoding the merlin protein, creates new possibilities 
fa accurate predictive testing in NF2. Until the NF2 gene was mapped to chromosome 22, no genetic testing was pos- 
sible. Once the gene was mapped, it became possible to predict NF2 carrier status using linked DNA markers (Narod, 

35 S A. et aL, Am. J. Hum. Genet. 51 : 486-496 (1992)). Unfortunately, testing by linkage is limited by the availability of mul- 
tiple family members for comparison, by lack of complete informativeness of linked markers, and by the diminished 
accuracy inherent in using a marker not located precisely within the disease gene. The precise identity of the NF2 gene 
discovered by the inventors now makes it possible to overcome these diagnostic hurdles by defining the exact molecular 
lesion associated with the disorder in any given pedigree. 

4C [0138] Because mutations in NF2 were expected in general to inactivate the gene, they could have fallen into sev- 
eral different categories. DNA rearrangements, such as deletions, inversions, insertions, or duplications could be 
expected. Mutations that eliminate expression of the entire transcript, interfere with exon splicing, or disrupt its stability 
could effectively inactivate the gene. Finally, DNA sequence alterations within the coding sequence that cause prema- 
ture termination of translation, yielding a truncated protein, or that result in a change in an amino acid residue critical 

45 for normal function of the merlin protein were also likely. 

[0139] In this application, as shown in the exemplary material, the SSCP technique was applied as a rapid means 
to locate and confirm the probable molecular basis of NF2 m a large and well studied kindred (Werteleckj, W. et ai. , New 
Engl. J. Med. 375:278-283 (1988)). The technique used strategically placed primer sets to amplify small regions of the 
NF2 gene directly from genomic DNA using PCR. The findings were highly specific; none of the unaffected 158 control 

sc samples produced an altered SSCP pattern. As would be predicted, all samples from affected members of the same 
kindred produced the same altered SSCP pattern, reflecting the identical nature of the underlying molecular defect. 
[0140] The sequence alteration in this family resulted in the substitution of a tyrosine residue for the asparagine at 
position 220 of the merlin protein. This change affects one of the most conserved stretches in the merlin protein as 
-hown in Fqurp " ? This particular asparaqine residue and aft surround! nrj residues a r e present n all threp doselv 
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that this alteration is the cause of NF2 in this pedigree. 
Example 6 

Analysis of merlin in Sporadic and Inherited Tumors 

[0141] To facilitate the search for mutations in NF2 and related tumors, the exon-intron junctions of the NF2 locus 
were sequenced, including the E16 additional exon that is alternatively spliced (Bianchi, A.B.. et ai. Nature Genet, in 
press. 1994; Haase, VR, et ai. Human Mof. Genet, in press, 1994). PCR assays were developed for all seventeen 
exons, and were used to scan the entire gene for mutations in sporadic and inherited vestibular schwannomas. The 
high proportion of tumors in which inactivating mutations were found indicates that the NF2 gene plays a fundamental 
role in schwannoma tumorigenesis. 

Materials and Methods 

Tissue samples 

[0142] Tumor specimens were obtained at the time of surgery and frozen for DNA analysis. Blood samples were 
also obtained at the time of surgery to serve as normal tissue controls. High molecular weight DNA was extracted from 
peripheral blood leukocytes and from frozen pulverized tumor tissue by SDS-proteinase K digestion followed by phenol 
and chloroform extraction (Seizinger, B.H., et ai. t Nature ^:S44-647 (i966)). 

Design of primer pairs 

[0143] Exonic primers were designed wrthin the NF2 coding sequence near the intron-exon borders as determined 
by the results of exon trapping (Trofatter, J.A., et aL, Cell 72 791 -800 (1993)). For those regions not isolated by trapping, 
primers were synthesized at approximate 100 base pair intervals. Using these primers, an intronic sequence was 
obtained by directly sequencing cosmids containing the gene using a cycle sequencing kit (US Biochemical), intronic 
primer pairs were then designed to amplify the splice donor and acceptor sites as well as the exon itself. In the case of 
El 2, it was necessary to construct two overlapping primer sets to maintain a product length of less than 300 base pairs. 

SSCP analysis 

[0144] SSCP analysis was performed according to the procedure of Orita et al. (Orrta, M., et ai, Genomics 5:874- 
879 (1989)) with minor modifications. Approximately 50 ng of genomic DNA was amplified using appropriate intronic 
primer pairs (Table 2). Each 10 jil reaction contained 70 each of dATP, dCTR dGTP and dTTP, 4 pmoles of each 
primer, 0.5 units Taq polymerase. 10 mM Tris pH 8 3, 15 mM MgCfe. 50 mM KG, 0.01% gelatin, and 0 1 ^l a[ 33 P]dATP 
(Amersham, 1 0 mCi/ml). For El , a MgCI 2 concentration of 0.5 mM was used. Amplification was carried out for 30 cycles 
as follows: 94°C for 1 min., 55-60°C for 1 min., and 72°C for 1 min.. after an initial denaturation step at 94°C for 4 mm. 
One \i\ of labeled amplified DNA was diluted into 9 ^l of 0.1% SDS and 10 mM EDTA, and an equal volume of loading 
dye (95% formamide, 0.5M EDTA, 0.05% bromphenol blue and 0.05% xylene cyanol) was added. The samples were 
denatured for 2 min. at 90°C and separated on 6-8% polyacrylamide gels containing 8% glycerol for 16 hrs. at 6-8W. 
Gels were dried and exposed to Kodak X-OMAT film. 

DNA sequencing 

[0145] For DNA sequencing, PCR amplications were performed in 50 \i\ volumes as described above for SSCP 
analysis except that 200 uM dNTPs were added and a radioactive nucleotide was omitted The product was sequenced 
by one of two methods. In the first, the double-stranded product was used as a template to generate single-strands by 
priming multiple rounds of DNA synthesis with one of the oligonucleotides previously used in the double-strand reaction 
(Gibbs, A , etai., Proc. Natl Acad. So. USA 56:1919-1923 (1989)) Conditions for asymmetric PCR amplification were 
identical except that only one primer was added. The product was ethanol precipitated in the presence of ammonium 
acetate and resuspended in 1C H 2 0 for subsequent sequence analysis. The sequencing reactions were performed 

H^-i « < , , ^n^Uv-r-)!, ■ n^th.yJ t,:, V ; ~\ v ; i ' ' T~7 ^ K.J /S np! v r»pr^Cp 1 ! £ R|OC H Pr">i 1 ■ ;rrfp r Tl'Yi tlOf^ 

• ~n<Jc\. . . - ...uune ■ ■■- i: ■ ■ - % >■ 

JNA DOiymerase anc tne Circumvent Ihermai uyc e wt iNew tnqiana bioLaos Both strands or exor.s wiin 
mobility shifts were analyzed in all cases. 
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Loss of heterozygosity analysis 

[0146] Genomic DNA was amplified using primer pairs for the polymorphic dinucieotide repeats at markers 
D22S193 (Trofatter etaL in preparation) or D22S268 (Marineau, C, etaf., Hum. Moi. Genet. 2:336 (1993)). The CA 
s strand primer was 5* end-labeled with polynucleotide kinase and gamma- 32 P ATP, and PCR was performed as 
described previously (Louis, D.N., et a/., Am. J. Pathol. 141:777-782 (1992)). In some cases, loss of chromosome 22 
alleles was determined by Southern Wot analysis using probes at the following loct: D22S22, D22S29. D22S28, 
D22S15. D22S1, CRYB2, D22S10 and D22S9 (Rouleau, G.A., etaL Genomics 4:1 -6 (1989)), 

w Results 

Exon Structure of the NF2 Gene 

[0147] The internal exons range in size from 45 base pairs to 218 base pairs, with an average of 11 1 base pairs. Of 
15 the fifteen internal exons, ten (E3, E5-7, E9-11, E14-16) were isolated using exon amplification, the technique 
described herein, which led to the isolation of the NF2 locus from cloned genomic DNA (see also Trofatter, J, A., ei ai, 
Cell 72:791-800 (1993)). Table 1 shows the DNA sequences immediately surrounding the intron-exon junctions, which 
all match the consensus for splice acceptor and donor sites. Additional intron sequences on both 5' and 3' sides that 
were used to design primers for PCR amplification are described herein in the section "Description of the Preferred 
20 Embodiments" (Figures 15A-Q herein). 

PCR Assays for the Seventeen NF2 Gene Exons 

[0148] For each internal exon, primers were chosen in flanking intron sequences to develop an assay for PCR 
25 amplification of the exon directly from genomic DNA. For E12, two overlapping primer sets were chosen to yield prod- 
ucts in a size range amenable to SSCP analysis. Because E1 2 is 218 base pairs, two sets of primers were employed, 
one set spanning the 5' intron-exon junction and the 5' region of the exon and the other overlapping set spanning the 
remainder of the exon and the 3' exon-intron junction. In this way amplified fragments of a suitable length for SSCP 
analysis (140 and 284 base pairs, respectively) were generated. All of the coding region from E1 was amplified using 
30 one primer in the 5 untranslated region (UTR) and one in intron 1 For E17, a primer from the final intron was paired 
with a primer in the 3'UTR to amplify all of the E17 coding sequence, (along with the first 100 base pairs of 3'UTR). All 
assays (except that for E1) were performed according to standardized amplification conditions. As discussed in the 
exemplary material, E1 was amplified using a MgCI 2 concentration of 0.5 mM rather than 1.5 mM (to optimize the yield 
of PCR product). The annealing temperature was varied for optimum results. Table 2 lists the primers, annealing tem- 
35 perature and product size for each exon assay. In some cases (E7 and E8), an initial set of primers was used, but sub- 
sequently was replaced with a second primer pair. In some cases, the original primer was too far removed from the exon 
for convenient DNA sequencing or too close to the exon to detect potential intronic mutations In such cases, primer 
pairs were used In these instances both sets of primer pairs are listed. 

*c Scanning for Mutations by SSCP Analysis of Blood-tumor Pairs 

[0149] The exon PCR assays listen in Table 2 were used to scan the entire merlin coding sequence for mutations 
in schwannomas. SSCP analysis was applied to DMA extracted from thirty eight primary tumor specimens, including 
eight vestibular schwannomas from NF2 patients (Table 3), twenty-seven sporadic vestibular schwannomas (Table 4) 

45 and three sporadic spina! schwannomas (Table 4, S9. S25 and S27). DNA extracted from a blood sample of the corre- 
sponding individual was used tor comparison in each case. All tumor-blood pairs were assayed for all exons. 
[01 50] Representative results of the SSCP analyses for three of the exons. (E2, E1 0 and El 4) are shown in Figures 
1 3A to 13C For each assay, a lane of PCR product was run without denaturation (ND) to identify fully reannealed, dou- 
ble-stranded DNA in the test lanes. The other bands in the test lanes represent various conformations- of the single- 

sc stranded DNAs in the product. For E2, the normal pattern is seen only in the blood DNA (B) of $29. S1 1 and S33 display 
mobility shifts that are detected in both blood and tumor DNA This result indicates germline alterations. S29 displays a 
mobility shift only in the tumor DNA. This result indicates a somatic mutation. For E10 and E14, all blood DNAs display 
the normal pattern and all tumor DNAs display drfferent mobility shifts. Both normal and altered PCR products were 
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gosity that could indicate deletion of one NF2 allele (Tables 3 and 4). General testing had predicted that at least sixteen 
of the thirty-eight tumors had lost one NF2 allele Thus, the entire NF2 coding sequence for 58-60 independent alleles 
was examined by SSCP. No obvious polymorphism affecting coding or non-coding sequences was observed. This 
result indicates a remarkable degree of homogeneity for this gene sequence in the population. In contrast, mutations in 
both NF2 and sporadic tumors were readily detected. DNA differences from normal have been confirmed in twenty- 
seven of these tumors. Seven germline and twenty-five somatic alterations have been identified by sequence analysis. 
The results are summarized in Tables 3 and 4. 

[0152] Germline mutations, present in both blood and tumor DNA, were delineated in five of the eight patients with 
a confirmed diagnosis of NF2 (Table 3). These changes occurred in disparate locations and included 1) point mutations 
creating stop codons at residues 57 and 60 of E2 (Si 1 and S33) and 1396 of E1 3 (S4); 2) a 28 base pair deletion cre- 
ating a frame shift and premature stop codon in E10 (St) and an insertion of one base into a splice donor site in E12 
(S32) Two other germline alterations were found in blood and tumor DNA from individuals without a confirmed NF2 
diagnosis (Table 4). These included 1) a single base change in the intron upstream of E7 in a sporadic spinal schwan- 
noma (S9) and 2) a substitution of Cys for Arg at residue 418 in a sporadic vestibular schwannoma (S44). Although the 
two intron changes and the apparent missense mutation could conceivably represent polymorphisms, they were not 
found by SSCP analysis of 150 independent DNA samples from normal individuals or individuals with other types of 
tumors. 

[0153] Somatic mutations were observed in tumors from five of the eight patients with NF2. These included two 
patients (S1 and S4) in whom a germline mutation was also detected (Table 3). In thirty sporadic tumors, a total of 
twenty somatic mutations were found (Table 4). In three cases in which chromosome 22 heterozygosity was maintained 
(S1 8, S24. and S29), two distinct somatic alterations were found in each tumor. For example, tunkx 329 displayed small 
deletions of one base pair and four base pairs in E2 and E8, respectively (Figures 14A and 14B). In one tumor (S35), 
a complex of two adjacent deletions was detected in a single allele. One deletion removed five codons in-frame. The 
other deletion (beginning three base pairs downstream) caused a frameshift. 

[01 54] The twenty-five somatic mutations from NF2 and sporadic tumors were found throughout the gene and were 
associated with E1 , E2, E3, E4, E7, E8, E9, E10, E12, E14 and E15. By far the most frequent lesions detected (19/25) 
were small deletions of one to sixty-one base pairs, that had either an obvious or presumed effect on splicing or that 
produced frameshifts that led to truncated proteins of altered sequence. A single mutation involved a frameshift result- 
ing from a single base insertion (S42) The remaining five somatic changes were point mutations that either altered 
splice donor (S22) or acceptor (S12, S3 7) sites, produced a stop codon at residue 212 (S24), or generated a Met for 
Val substitution at residue 2 1 9 (S 1 ) . 

Discussion 

[0155] The results of the studies herein defining specific mutations immediately allow improved prenatal, presymp- 
tomatic and unaffected diagnosis for family members at risk. Highly accurate prenatal testing is now possible by direct 
examination of fetal DNA for the presence of characteristic SSCP shifts, altered DNA sequences, or gain or loss of 
restriction sites. Similarly, presymptomatic testing using these approaches could eliminate the need for up to half of at 
risk family members to undergo expensive and time consuming clinical monrtoring, reducing the considerable financial 
and psychological burdens on these individuals. For those that test positive for the specific changes, medical care 
should be improved by the clarification of their status earlier in the course of their disease, which in turn might allow ear- 
lier consideration of surgical intervention. With the delineation of a larger number of mutations, it now becomes possible 
to discover whether the NF2 gene contains mutational "hot spots" which would simplify scanning. 
[0156] NF2 is a disorder consistent with a "two-hit" model of tumorigenesis, in which homozygous inactivation of a 
gene that normally suppresses tumor growth is the critical event in tumor formation (Knudson, A G., Proc. Nati. Acad. 
So. USA 68:820-823 (1971)). The same types of tumors that are present as multiple independent growths in NF2 
patients occur as sporadic, solitary cases in the general population (Martuza and Eldridge, New Eng. J. Med. J7<3:684- 
688 (1988); Mulvihill, J , et aL Ann. Intern. Med. 7 73:39-52 (1990)). The familial and sporadic tumors both display fre- 
quent loss of genetic material from chromosome 22, in a region to which the NF2 gene defect has been mapped by 
linkage analysis (Kaiser- Kupfer, M.I.. et aL Arch. Ophthalmol. 707:541-544 (1989); Rouleau, Q.A., et at, Nature 
329:246-248 (1987); Wertelecki, W, et aL New Engl. J. Med. 379:278-283 (1988); Rouleau, G.A., et aL Am. J. Hum 
Genet 46:323-328(1990); Narod, S.A., era/., Am. J. Hum. Genet 57:486-496 (1992); Seizinger, B.R., et aL Nature 
322:644-647 (1986); Seizinger. B.R.. et al , Science 23631 7-319 (1987); Seizinger, B.R., et aL p rcc. Nati Acao. Set 
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affected in this way as the vast majority of tumors seen in NF2 are schwannomas, and particularly vestibular schwan- 
nomas and meningiomas. 

[01 57] The results herein implicate merlin as a tumor suppressor. Whereas germline mutations are present in both 
blood and tumor DNA from NF2 patients, somatic mutation of merlin is a frequent event in schwannomas. In several 
5 cases, two inactivating mutations were detected in the same tumor. In many others, a single mutant allele remained fol- 
lowing the loss of the second copy of the locus. The alterations occured throughout the NF2 gene. Most exons dis- 
played at least one mutation. 

[0158] One alteration, conversion of the Arg codon at position 57 to a stop codon, has been seen twice before in 
independent NF2 patients (Rouleau, G.A , et ai., Nature 365:515-521 (1993)), suggesting that this site containing a 

ic cCpG dmucleotide may be particularly prone to C T transitions. The presence of this change in the blood DNA of S1 1 
combined with this absence from either parent demonstrate that this is a case of a new mutation to NF2. 
[0159] Two new missense mutations were identified, Val219Met and Arg418Cys, which may target these residues 
as particularly important in merlin s tumor suppressor function. The Val at position 219, one residue away from the pre- 
viously reported Asn220Tyr mutation, is located within the protein 4 1 domain that is characteristic of this family and is 

is conserved in human moesin, radixin, and protein 4.1, but changed to lie in human ezrin. The Arg at position 418 is 
located in the long a-helical domain that comprises most of the C-termmal half of merlin and its relatives but is not 
strictly conserved in the other human members of this protein family 

[0160] A surprising number of somatic mutations involve changes in intron sequences. These may occur at a dis- 
tance from the exon-intron junction. The absence of these alterations in blood-derived DNA of the same individuals, and 
20 the failure to detect the same change in any other individuals, indicate that these mutations are de novo events asso- 
ciated with tumor formation 

[0161] In addition to expanding the number and variety of germline NF2 mutations described, the examples herein 
suggest that germline alterations occur in patients not yet diagnosed clinically with NF2. One patient, with a single spi- 
nal schwannoma, displayed a single base alteration in an intron sequence. Another with a single vestibular schwan- 
25 noma, displayed the Arg41 8Cys change described above. Careful clinical follow-up is indicated for these patients. It is 
possible that they represent a class of individuals with mutations that only milcfly affect normal merlin function, and con- 
sequently do not produce the full NF2 phenotype. 

[0162] The development of reliable PCR assays for each exon of the NF2 gene should facilitate greatly the cata- 
loging of mutations in NF2 patients and their tumors by genomic scanning. It can be expected that a detailed mutational 
30 analysis of the NF2 gene, identifying sites particularly prone to alteration, pinpointing amino acd residues crucial for 
normal function, and providing a basis for relating specific alterations with variations in phenotype will result. Perhaps 
most important, however, the ability to rapidly scan the NF2 gene for mutations will accelerate the assessment of a role 
for merlin in other tumor types. 
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Table 1 

Intron-Exon Boundaries of the NF2 Gene 
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£9 


AGG 


S16 


G 7 AAG AG ATT 


6 


rrrrrGGTAG 


$17 | 3TA 


33 


CAG 


599 


GTGAGGCCCA 


7 


CTCCCCACAG 


1 

SCO I GGA 


"6 


CGG 


675 


GTGTGTTGAA 


,=l 


GGATC CACAO 


675 


AAT 


13S 


GAG 


310 


GTAGGACATG 


9 


ATT err C CAG 


3 11 




"5 


CTG 


385 


GTAAGTTGAG 


1 0 


GTGGCCACAG 


386 


ATT 


: 14 


CAG 


999 


GTGAGCACAA 


1 L 


CCCCTCGCAG 


LOCO 


ATG 


12 3 


CTG 


1122 


GTGATTTCTG 


12 


TGCCCTCCAG 


1 1 2 3 


ATG 


2 18 


GAG 


1340 


GTGAGGGGGC 


3 


ttccttgcag 


13 4 1 


G\j C 


136 


:cg 


1446 


GTGAGCCTGG 




TCATTAACAG 


■-^\ ^- 


: : 3 


.^AA 


1S74 


GTATGTAGCC 


: 5 


T~GCCGGCAG 




163 


AAG 


173 7 


G7ACCCAGGC 


i 6 


•3CTGGTTTAG 




AAA 


17 3 2 


GTAGGTTGTT 




77TCTTACAG 


l-°2 \ CTC 










■ ?.a3e pair numbering is cased en =1 ~e:-g "he A cr the initiator ATG 
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SEQUENCE LISTING 



1) GENERAL INFORMATION: 

i'i) APPLICANT: THE GENERAL HOSPITAL CORPORATION 
Fruit Street 

Boston, Massachusetts 02114 
United States of America 

INVENTORS: Trofatter, James A. 

MacCollin, Mia M. 
Gusella, James F. 

(ii) TITLE OF INVENTION: Tumor Suppressor Gene Merlin and Uses 
Thereof 



(iii) NUMBER OF SEQUENCES: 118 

(iv) AGENTS ADDRESS: 

(A) ADDRESSEE: Kilburn & Strode 

(B) STREET; 3 0 John Street 
20 (C) CITY: LONDON 

<D) COUNTRY: Enyleuid 

(E) POSTAL CODE: WClN 2DD 

(v) COMPUTER READABLE FORM : 

(A) MEDIUM TYPE: Floppy disk 
25 (B) COMPUTER : IBM PC compatible 

(C) OPERATING SYSTEM : PC- DOS /MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vij CURRENT APPLICATION DATA: 

(A) FILING DATE: 25th February 1994 



(vii! PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US (co be assigned) 

(B) FILING DATE: 22-DEC-1993 

(vii) PRIOR APPLICATION DATA : 

(A) APPLICATION NUMBER: US 08/108,803 
(3) FILING DATE: 19-AUG-1993 

(vii) PRIOR APPLICATION DATA : 

(A) APPLICATION NUMBER: US 08/022,034 
(3) FILING DATE: 25-FEB-1993 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/026,053 

(B) FILING CATE : 04-MAR-1993 



INFORMATION FOR SEQ ID NO : 1 : 

i) SEQUENCE CHARACTERISTICS : 

( A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CAGATTGTTC ATTCCAAGTG G 

5 (2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 20 base pairs 
<B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D> TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

ACCCTGAGCA ATCCACTACC 

(2) INFORMATION FOR SEQ ID NO ; 3 : 

15 (1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 
< B ) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEC ZD NO: 3: 

TGCACACACA TCCTTTTCAC 2 0 

(2) INFORMATION FOR SEQ ID NO : 4 : 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bace pairs 
( B} TYPE: nucleic acid 

(C) STRANDEDNESS : ainale 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ IC NO: 4: 

GAGAGAGACT CCTGTCTCAA AAA 23 

(2) INFORMATION FOR SEQ ID NO : 5 : 

35 :i> SEQUENCE CHARACTERISTICS: 

!A) LENGTH : 19 base pairs 
;3) TYPE: nucleic acid 
;c) STRANDEDNESS: single 
{ T» TOPOLOGY: linear 

4C 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

AGGAGGC7GA ACGCACGAG 19 

[2) INFORMATION FOR SEQ ID NO : 5 : 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 
O) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



5C 



(xi } SEQU T DESCRIPTION: SEQ ID NO : 6 : 
TGGTATTGTG CTT ^ JTGCTG 2 3 
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(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

C7TCAACCTG ATTGGTGACA G 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 20 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

T GOT ATT GTS CTTGCTGGTG 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 
{A} LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO : S : 

AGGTACTGGA TCATGATGTT 7C 

(2) INFORMATION FOR SEQ ID NO: 10: 

SEQUENCE CHARACTERISTICS: 
ik) LENGTH: 21 base pairs 
i B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 ) : 

TTTGGAAGCA ATTCCTCTTG G 

\2) INFORMATION FOR SEQ ID NO : 1 1 : 

;i) SEQUENCE CHARACTERISTICS; 

;A) LENGTH: 2 1 base pairs 
;3) TYPE: nucleic acid 
;C} STRANDEDNESS : single 
( D ) TOPOLOGY: linear 



■xi) SEQUENCE DESCRIPTION: SEQ ID NC : 1 1 : 
rCAGCCAGCT CCCTATGGAT Z 
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(2) INFORMATION FOR SEQ ID NO 12. 

(i: SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
(31 TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY linear 



(xi) SEQUENCE DESCRIPTION: 3EQ ID NO: 12: 
AGCTGAAATG GAATATCTGA AG 
(2} INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 
15 <A> LENGTH: 20 baBe pairs 

(B) TYPE: nucleic acid 

(C) STRANEEDNESS : single 
£D) TOPOLOGY; linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

GCCTTCTCCT CCCTGGCCTG 2C 

(2! INFORMATION FOR SEQ ID NO * 1 4 : 

Ci; SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 22 base pairs 

(3} TYPE: nucleic acid 
(C) STRANDEDNESS : single 
(D! TOPOLOGY linear 
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!x:] SEQUENCE DESCRIPTION: SEC ID NO : 14 : 
GATGGAGTTC AATTGCGACA 7G 2 2 

£2) INFORMATION FOR SEQ ID NO ■ 1 5 : 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2257 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: sincie 
(D) TOPOLOGY linear 

■ix) FEATURE: 

(A) NAME / KEY ; CDS 

(B) LOCATION: 220. .2CC4 

:xi) SEQUENCE DESCRIPTION: SEC ID NO: 15: 

ACGGCAGCCG TCAGGGACCG TCCCCCAACT rcCCTTTCCG C7CAGGGAGG 3TCC7CGCGG €0 

rcCATGCTGG CCGCTGGGGA CCC3CGCAGC CCAGACCGTT C CCGGGCCGG CCAGCCGGCA 120 

CCATGGTGGC CCTGAGGCCT GTGCAGCAAC TCOAGGGGGG ^TAAAGGGCT CAGAGTGCAG 180 

GCCGTGGGGC GCGAGGG7CC CGGGCCTGAG CCCCGC3CC ATG GCC GGG GCC ATC 234 

Met Ala Gly Ala He 

1 5 

GCT TGC CGC ATG AGC 77C AGC 7CT CTC AAG AGG AAG CAA CZZ AAG ACG 2B2 
A. a Ser Arg Mec Ser Fhe Ser Ser leu Lys Arg Lvs Gin Pro Lys Thr 

10 IS ' 23 
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TTC ACC GTG AGG ATC GTC ACC ATG GAC CCC GAG AT 3 GAG TTC AAT TGC 3 30 

Phe Thr Val Arg lie Val Thr Met Asp Ala Glu Me: Glu Phe Asn Cyo 

25 3C 35 

GAG ATG AAG TGG AAA GGG AAG GAC GTC TTT GAT TT3 GTG TGC CCG ACT 3 78 

Glu Met Lys Trp Ly» Gly Lys Asp Leu Phe Asp Leu Val Cys Arg Thr 

40 45 50 

CTG GGG CTC CGA GAA ACC TGG TTC TTT GGA CTG CAG TAC ACA ATC AAG 4 26 

Leu Gly Leu Arg Glu Thr Trp Phe Phe Gly Leu Gin Tyr Thr lie LyB 

55 60 &=j 



GAC ACA GTG GCC TGG CTC AAA ATG GAC AAG AAG GTA CTG GAT CAT GAT 4 74 

Asp Thr Val Ala Trp Leu Lys Met Asp Lys Lvs Val Leu Asp Hie Asp 
70 75 SO 85 

:S GTT TCA AAG GAA GAA CCA GTC ACC TTT CAC TTC TTC GCC AAA TTT TAT 522 

Val Ser Lys Glu Glu Pro Val Thr Phe Hie Phe Leu Ala Lys Phe Tyr 

9 0 ?5 100 

CCT GAG AAT GCT GAA GAG GAG CTG GTT CAG GAG ATC ACA CAA CAT TTA 5^0 

Pro Glu Asn Ala Glu Glu Glu Leu Val Gin Glu Ha Thr Gin His Leu 
10S 110 115 

20 

TTC TTC TTA CAG GTA AAC, AAG CAG ATT TTA GAT CAA AAG ATC TAC TGC SIS 

Phe Phe Leu Gin Val Lvs Lyts Gin lie Leu As? Glu Lys lie Tyr Cys 
120 1 2 S 130 

CCT CCT GAG GCT TCT GTG CTC CTG GCT TCT TAG GCC GTC CAG GCC AAG 5 66 

Pre Pro Glu Ala ser Val Leu Leu Ala Ser Tyr Ala Val Gin Ala Lye 

25 135 140 145 

TAT GGT GAC TAC GAC CCC AGT GTT CAC AAG CGG GGA TTT CTG GCC CAA 714 

Tyr Gly Asd Tyr Asp Pro Ser Val His Lys Arg Gly Phe Leu Ala Gin 

150 * 155 ISC 155 

GAG GAA TTG CTT CCA AAA AGG GTA ATA AAT CTG TAT CAG ATG ACT CCG 762 

G^u Glu Lau Leu Pro Lvs Ara Val He Asn Leu Tyr Gin Met Thr Pro 
30 ' 17Q " 175 160 

GAA ATG TGG GAG GAG ACA ATT ACT GCT TGG TAC CCA GAG CAC CGA GGC 810 

Glu Met Trp Glu Glu Ara He Thr Ala Trc Tyr Ala Glu His Arg Gly 
185 150 195 



35 
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CGA GCC AGG GAT GAA GCT GAA ATG GAA TAT CTG AAG ATA GCT CAG GAC 3 5S 

\rg Ala Arg Asd Glu Ala Glu Met Glu Tyr Leu Lys He Ala Gin Asp 

20Q " 205 210 

CTG GAG ATG TAC GGT GTG AAC TAC TTT GCA ATC CGG AAT AAA AAG GGC 9 06 

Leu Glu Met Tvr Gly Val Asn Tvr Phe Ala H <■? Arc Asn Lys Lys Gly 
215 " 223 225 

ACA GAG CTG CTG CTT GGA GTG GAT GCC CTG GGG CTT CAC ATT TAT GAC 9 34 

Thr Glu Leu Leu Leu Gly Val Aap Ala Leu Gly Leu His lie Tyr Asp 

230 235 140 245 

CCT GAG AAC AGA CTG ACC CCC AAG ATC TTC TTC CCG TGG .AAT GAA ATC 1Q32 

Pro Glu Asn Arg Leu Thr Pro Lys He Ser Phe .-r=r Trp Asn Glu He 

25C 155 250 

CGA AAC ATC TCG TAC AGT GAC AAG GAG TTT ACT AT" .AAA CCA CTG GAT 1050 

Arc Asn He Ser Tyr Ser Asd Lys Glu Phe Thr He Lys Pro Leu Asp 

2SS ' 270 275 

AAG AAA ATT GAT GTC TTC AAG TTT AAC TCC TCA AAG CTT CGT GTT AAT 1098 

Lys Lys lie Asc Val Phe Lys Phe Asn Ser Ser Lys Leu Arg Vai Asn 

280 285 2?: 

AAG CTG ATT CTC CAG CTA T3T ATC GGG AAC CAT GAT CTA TTT ATG AGG 114 6 

L/c Leu He Leu Glr. Leu Cys He Gly A^n -19 As? Leu Phe Met Arg 

295 300 -C5 
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AGA AGG AAA GCC GAT TCT TTG GAA GTT CAG CAC ATG AAA GCC CAG GCC 
Arg Arg Lys Ala Asp Ser Leu Glu Val Gin Gin Met Lys Ala Gin Ala 
310 315 320 ' 325 



1C 



AGG GAG GAG AAG GOT AGA AAG CAG ATG GAG CGG CAG CGC CTC GCT ^GA 
Arg Glu Glu Lys Ala Arg Lya Gin Met Glu Arg Gin Arg Leu Ala Aro 
330 335 340 

GAG AAG CAG ATG AGG GAG GAG GCT GAA CGC ACG AGG GAT GAG TTG GAG 
GlU L y* Gia M * c Ar 9 G * u Glu Ala Glu Arg Thr Arg Asp Glu Leu Glu 
345 35Q 355 



1194 



124: 



129C 



AGG AGG CTG CTG CAG ATG AAA GAA GAA GCA ACA ATG GCC AAC GAA GCA 
Arg Arg Leu Leu Gin Met Lya Glu Glu Ala Thr Met Ala Asn Glu Ala 
360 365 370 

CTG ATG CGG TCT GAG GAG ACA GCT GAC CTG TTG GCT GAA AAG GCC CAG 

L«u Met Arg Ser Glu Glu Thr Ala Aso Leu Leu Ala Glu Lys Ala Gin 
3? S 380 * 385 



1338 



1386 
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ATC ACC GAG GAG GAG GCA AAA CTT CTG GCC CAG AAG GCC GCA GAG GCT 
lie Thr Glu Glu Glu Ala Lys Leu Leu Ala Gin Lys Ala Ala Glu Ala 
390 395 400 405 

GAG CAG GAA ATG CAG CGC ATC AAG GCC ACA GCG ATT CGC ACG GAG GAG 
Glu Gm Glu Met Gin Arg lie Lys Ala Thr Ala He Arg Thr Glu Glu 
410 415 420 



505 



515 



-CC ~TG GAG ATA GAG AAA GAA AAA GTG GAA TAC ATG GAA AAG AGC AAG 
jcr Met GiU 3iu Lys Glu Lys Val Glu Tvr Met Glu Lys Ser Lys 

"0 525 53 C 



1434 



1482 



« 5 AG AAG CGC CTG A ? G GAG CAG AAG GTG CTG GAA GCC GAG GTG CTG GCA ^ 53 0 

G - u L Y* Ar 9 - eu Mec Glu Gin Lys Val Leu Glu Ala Glu Val Leu Ala 
425 430 435 

CTG AAG ATG GCT GAG GAG TCA GAG AGG AGG GCC AAA GAG GCA GAT CAG 1578 

Leu Lys Met Ala Glu Glu Ser Glu Arg Arg Ala Lys Glu Ala Asp Gin 

440 445 4So 



... AAG CAG GAC CTG CAG GAA GCA CGC CAG GCG GAG CGA AGA GCC AAG 16 25 

- su L Y C Gln As P T -*u Gin Glu Ala Arg Glu Ala Glu Arc Arg Ala Lye 
45 = 460 4b5 

CAG AAG CTC CTG GAG ATT GCC ACC AAG CCC ACG TAC CCG CCC ATG AAC 167^ 

3*a Lys Leu Leu Glu He Ala Thr Lys Pro Thr Tyr Pro Pro Met Asn 

470 475 460 * 485 

:CA ATT CCA GCA CGG TTG CCT CCT GAC ATA CCA AGC TTC AAC CTC A^T 172 2 

Pro lie Pro Ala Pro Leu Pro Pro Aso He Pro ?er Phe Asn Leu lie 
430 495 5qo 

GGT GAC AGC CTG TCT TTG GAC TTC AAA GAT ACT GAC ATC AAG CGG CTT 17 7C 

Hy Asp Ser Leu £*r Phe Asp Phe Lys Asp Thr Asp Met Lys Arg Leu 



1313 



45 ~ AT T 70 CAG GAG ~ AG c ^ AAT GAA CTC AAG ACA GAA ATC GAG GCC TTC 13 66 

.-.13 Leu Gin Glu Gin Leu Asn Glu Leu Lvs Th- G'u '1- Glu Ala Leu 

S35 54D ' 545 

AAA CTG AAA GAG AGG GAG ACA GCT CTC CAT ATT CTG CAC AAT GAG AAC i9i- 

-ys -au Lys Glu Arg Glu Thr Ala Leu Aso He Leu His Asp. Glu Asn 

55 ° 555 * 560 565 

TCC GAC AGG GGT GGC AGC AGC AAG CAC AAT ACC ATT .AAA AAG CTC ACC 19 5 2 

3er A3 P A *'9 Gly Gly Ser Ser Lye His Asn Thr He Lys Lvs Leu Thr 

570 575 ' 
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TTG CAG AGC GCC AAG TCC CGA GTG GCC TTC TTT GAA GAG CTC 
Leu Gin Ser Ala Lya S»r Arg Val Ala Phe Phe Glu Glu Leu 
535 590 S95 



2004 



TAGCAGGTGA CCCAGCCACC CCAGGACC7G CCACTTCTCC TOCTACCGGG ACCGCGCGAT 2 0 64 

GGACCAGATA TCAAGAGAGC CAT CCA TAG G GAGCTGGCTG GCGGTTTCCG TGGGAGCTCC 212 4 

AGAACTTTCC CCAGCTGAGT CAACACCCCA GCCCCTCTTA TGTCCAATTC CCTTGAACTA 2184 

CGACCCTGTA GAGATTTCTC TCATGGCGTT CTAGTTCTCT GACCTGAGTC TTTCTTTTAA 224 4 
CAAGTATTTG TCT 



(2) INFORMATION FOR SEQ ID N'0:16: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 5 amino ac^ds 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N T 0:16: 

20 Met Aid Gly Ala He Ala Ser Arg Met Ser Phe Ser Ser Leu Lys Arg 

1 5 10 IS 

Lys Gin Pro Lys Thr Phe Thr Val Arg lie 7a 1 Thr Met Asp Ala Glu 

20 25 30 

Mec Glu Phe Asn Cys Glu Met Lys Trp Lys Glv Lys Aso Leu Phe Asp 
25 25 4 0 4 5 

Leu Val Cye Arg Thr Leu Gly Lea Arcr Glu Thr Tro Phe Phe Glv Leu 
SO 55 " 6 b 

Gin Tyr Thr He Lys Asp Thr Val Ala Tro Leu Lvs Met Aso Lvs Lys 
55 7 0 75 3C 

30 

Val Leu Asp His Asp Val Ser Lys Glu Glu Pro v*I Phe His Phe 

8 5 * 9 0 95 

Leu Ala Lys Phe Tyr Pro Glu Asn Ala Glu Glu Glu Leu Val Gin Glu 
100 135 110 

:le Thr Gin Kis Leu Phe Phe Leu Gin Val Lvs Lvs Gin lie Leu Asp 
35 115 120 * * 125 

Glu Lys He Tyr Cys Pro Pro Glu Ala Ser Val Leu Leu Ala Ser Tyr 
- 3C 135 



Ala Val Gin Ala Lys Tyr 31/ Asp Tyr Aso Fro Ser Val His Lvs Arq 

US 150 * 155 163 

Gly Phe Leu Ala Gin Glu Glu Leu Leu Pre Lvs Aro Val lie Asn Leu 

165 170 * " 175 

Tyr Gin Met Thr Pro Glu Met Trp Glu Glu Arg 11* Thr Ala Tro Tvr 

ISC las :?3 



45 Ala clu H ~ 3 Ar 9 Glv Arq Ala Arg Asp Glu AH 



..*u ivr ~eu 



1^5 22 0 ::s 

Lys lie Ala Gin Asd Leu Glu Met Tvr Gly Val As - Tv~ D he Ala He 

210 * 215 ' 22: 

Arg A3n Lys Lys Gly Thr Glu Leu Leu Leu Glv Val Aso Ala Leu Glv 

225 23C 235 240 

Leu His He Tyr Asp Pro Glu Acn Aro Leu Tnr Pre Lys He Ser Phe 

245 " ?c? : 2SS 
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Pro Trp Asn Glu lie Arg Asn lie Ser Tyr Ser Asr> Lys Glu Phe Thr 
260 265 270 

lie Lys Pro Leu Aap Lve Lys He Abd Val ?he Lys ?he Aen Ser Ser 

275 280 * 785 

Lya Leu Arg Val Asn Lye Leu He Leu Gin Leu Cys lie Gly Aen His 

290 295 300 

Asp Leu Phe Met Arg Arg Arg Lya Ala Asp Ser Leu Glu Val Gin Gin 

3C5 310 315 320 

Met Lys Ala Gin Ala Arg Glu Glu Lys Ala Arg Lys Gin Met: Glu Arg 
325 330 335 

Gin Arg Leu Ala Arg Glu Lys Gin Net Arg Glu Glu Ala Glu Arg Thr 
340 345 350 

Arg Asp Glu Leu Glu Arg Arg Leu Leu Gin Ket Lys Glu GLu Ala Thr 
355 360 365 

Met Ala Aen Glu Ala Leu Met Arg Ser Glu Glu Thr Ala Asp Leu Leu 

370 375 380 

Ala Glu Lys Ala Glr. He Thr Glu Glu Glu Ala Lys Leu Leu Ala Gin 
33S 390 395 400 

20 

L/» Ala Al<s Glu A*a Glu Gin. Glu Mc t Gin A rcr lie Lvc Ale: Thr Ala 
4 05 4 10 ' 415 

He Arg Thr Glu Glu Glu Lys Arg Leu Met Glu Gin Lys Val Leu Glu 
420 425 430 

25 Ala Glu Val Leu Ala Leu Lys J<et Ala Glu Glu Ser Glu Arg Arg Ala 

435 44C 44 5 

Lys Glu Ala Asp Gin Leu Lya Gin Asp Leu Gin Glu Ala Arg Glu Ala 
450 455 " 460 

Glu Arg Arg Ala Lys Glr. Lys Leu Leu Glu He Ala Thr Lys Pro Thr 
30 465 47C 47S * 480 

Tyr Pro Pro Met Asn Pro lie Pro Ala Pre Lec Pro Pre Asp He Pro 
485 49 0 495 

Ser Phe Asn Leu He Gly Asp Ser Leu Ser Phe Aso Phe Lys Asp Thr 
50C 505 " SIC 
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Asp Me: Lys Arg Leu fer Xet Glu He Glu L/s Glu Lvs Val Glu Tyr 
5lo 520 525 

Met Glu Lys Ser Lys His Leu Gin Glu Gin Lau Asn Glu Leu Lye Thr 
530 535 54C 

Glu He Glu Ala Leu Lys leu Lys Glu Arc Clu Thr Ala Leu Asp He 
545 550 " ' 555 560 

Leu Kis Asn Glu ksn Ser Asp Arg Gly Gly Ser Ser Lys His Asr. Thr 

Tie Lys Lys Leu Thr Leu Gin Ser Ala Lys Ser Arg Val Ala Phe Phe 
580 595 ' 590 

Glu (ilu Leu 
59 5 



(2) INFORMATICS* FOR SEQ ID MO .17. 

f.i) SEQUENCE CHARACTERISTICS: 
!AJ LENGTH: 21 base pairs 
iS) TYPE: nucleic acid 
'.C) 5TRANDECNESS : single 
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(D) TOPOLOGY: linear 

5 <xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 7 : 

CCATCTCACT TAGCTCCAAT G 21 

(2) INFORMATION FOR SEO 10 NO: 18: 

<i) SEQUENCE CHARACTERISTICS : 
<C (A} LEMGTH : 20 base pairs 

(3} TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
<D) TOPOLOGY: linear 

'S (xi) SEQUENCE DESCRIPTION : SEQ ID NC:1S: 

CTCACTCACT CTCTGTCTAC 20 

(2) INFORMATION FOR SEQ ID NO: 19: 

<i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : b ingle 

(D) TOPOLOGY: linear 



25 (xi; SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GTAACCGGCC 

(2} INFORMATION FOR SEQ ID NO:2C: 

(i) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH: 10 base pairs 

IB} TYPE: nucleic acid 
(C) STRANDEDNESS: single 
:D) TOPOLOGY: linear 



35 ( X i) SEQUENCE DESCRIPTION; SEQ ID NO:2C; 

GTTATTGCAG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
*0 (A) LENGTH: 13 base oairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
iD) TOPOLOGY: linear 



d 5 'xi) SEQUENCE DESCRIPTION: SEC ID KG : 2 1 

GTTGGGCTAG 

(2) INFORMATION FOR SEQ IC NO : 2 2 : 

(i) SEQUENCE CHARACTERISTICS: 
50 (A! LENGTH: ID base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TC PC LOGY: linear 
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(xi' SEQUENCE DESCRIPTION: SEQ ID NO:22: 
AATTCTCCAG 

(2) INFORMATION FOR SEQ ID NO:23; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
'5 GTACATCAGT 



(2) INFORMATION FOR SEQ ID MO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

25 

CTCCTTTCAG 

(2) INFORMATION FOR SEQ ID NO : 2 5 : 

<i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
3C (B) TYPE: nucleic acid 

;C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2S: 
GTAGGCTCAA 

,,2) INFORMATION FOR SEQ ID NO:25: 

!i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
{ D } TOPOLOGY: linear 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO :2b. 

TTCTTTCCAG 

(2) INFORMATION FOR SEQ ID NO : 2 7 : 

(i) SEQUENCE CHARACTERISTICS: 
5C (A) LENGTH: 1-3 base pans 

(B) TYPE: nucleic acid 
(C! STRANDEDNESS; single 
;'D! TOPOLOGY: linear 
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25 



30 



35 



50 



<xi> SEQUENCE DESCRIPTION: SEQ ID SO: 21: 

GTAAGAGATT 10 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS : 
(A} LENGTH: 10 sase pair a 
(3) TYPE ; nucleic acid 
(C) STRANDEDNESS : single 
(0) TOPOLOGY ; linear 

(xi> SEQUENCE DESCRIPTION. SEQ ID NO: 28: 
TTTTTGGTAG 1 0 

(2) INFORMATION FOR SEQ ID NO .29: 
(1) SEQUENCE CHARACTERISTICS: 



LENGTH: 10 base pairs 
TYPE; nucleic acid 
STRANDEDNESS: sincle 
TOPOLOGY: linear 



!xi) SEQUENCE DESCRIPTION- SEQ ID MO: 29: 

GT3AGGCCCA 10 

(2 ) INFORMATION FOR SEQ ID NO : 3 C ; 

(i) SEQUENCE CHARACTERISTICS: 
(A! LENGTH: 10 base pairs 

(B) TYPE: .lucleic acic 
(C: STRANDEDNESS: single 
(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO ; 3 0 : 

CTCCCCACAG 10 

( 2) INFORMATION FOR SEQ ID NO -.31: 

(i) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH ; 10 base pairs 
(3; TYPE: r.ucieic acic 

(C) STRANDEDNESS: sinqle 
O: TOPOLOGY : linear 

■Xi) SEQUENCE DESCRIPTION. SEQ ID NC : I i , 
GT3TGTTGAA 10 

\2) INFORMATION FOR SEQ ID NO 32: 

45 ( i ^ SEQUENCE CHARACTERISTICS: 

A ) LENGTH: 10 base pairs 
3! TYPE: nucleic acid 
C; STRANDEDNESS : single 
D) TOPOLOGY : linear 



!xi) SEQUENCE DESCRIPTION: SEQ 10 MC 
GCATCCACAC 
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(2) INFORMATION FOR SEQ ID NO ; 2 3 ; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 10 base oairs 

(B) TYPE: nucleic acid 
<C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 3 : 
GTAGGACATG 1 0 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34; 

ATTCTTCCAG 10 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 10 base pairs 

(B) TYPE: nuclaic acid 
<C) STRANDEDNESS; single 
(D) TOPOLOGY : linear 

30 [xi) SEQUENCE DESCRIPTION: SEQ ZD NO : 3 5 : 

GTAAGTTGAG 1 3 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS; 
35 (A) LENGTH: 10 base pairs 

{ B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 



4C 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GTGGCCACAG 10 
'2 ) INFORMATION FOR SEQ ID NO : J '/ : 

!i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 10 base pairs 

(B) TYPE; nucleic acid 
•C) STRANDEDNESS: sincie 
;D) TOPOLOGY : linear 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO ; 3 7 . 
GTGAGCACAA 1C 
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(2) INFORMATION FOR SEQ ID NO : 3 8 : 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH : 10 base pairs 
(B) TYPE : nucleic acid 
tC) STRANDEDNESS: •ingle 
O) TOPOLOGY: linear 



•xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 a • 
CCCCTCGCAG 

{2> INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 b*«e pair* 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

20 GTGATTTCTS 10 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
(S) TYPE: nucleic acid 
25 <C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 0 

30 TGCCCTCCAG 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 
;a; LENGTH: ID base pairs 
(El TYPE: nucleic acid 
35 ;C) STRANDEDNESS: single 

;D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 ] 

4Q GTGAGGGGGC 

(2) INFORMATION FOR SEQ ID NO : 4 2 : 

!i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 10 base pairr, 
{ B ) TYPE: r.ucleic acid 
ls (CI STRANDEDNESS: single 

{ D ) TOPOLOGY: linear 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID MC : 4 1 
ITCCTTGCAG 

(2) INFORMATION FCR S£Q ID NO : 4 3 : 
' i : S EC L'ENCE CHA RACTER I S T I CS : 
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(A) LENGTH; 10 base pairs 

(B) TYPE: nucleic acid 
CCJ STRANDEDNESS : single 
\D) TOPOLOGY: linear 



15 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 3 : 
GTGAGCCTCG 1 3 

(2) INFORMATION FOR SEQ ID NO; 44; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDHDNESS : sinale 
(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4 4 ; 
rCATTAACAG 1 0 

(2) INFORMATION FOR SEQ ID NO : 4 5 : 

(?) SEQUENCE CVJLRACTERI 5TICS • 

(A) LENGTH: 10 baee pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS * single 
iD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 5 : 

GTATGTAGCC 10 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS ; 

•A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
tO) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION; SEQ ID NO: 46: 

[2) INFORMATION FOR SEQ ID NO : 4 7 : 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 10 base pairc 
(3) TYPE: nucleic acid 
<C) STRANDEDNESS: single 

( D ) TOPOLOGY: linear 



'x;! SEQUENCE DESCRIPTION. SEQ ID MO 4 7 
jTACCCAGGG 

f 2) INFORMATION FOR SFQ ID NO -43: 
■ : } SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EX NESS : single 
{ D ) TOPOLOGY: linear 



fxi) SEQUENCE DESCRIPTION: SEQ ID NO:48 : 

GCTGCTTTAC 1 0 

(2) INFORMATION FOR SEQ ID NO:49; 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 10 base pairs 
( 3 ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
' jTAGGTTGTT 10 
(2) INFORMATION FOR SEQ ID NO: 50: 

Ml SEQUENCE CHARACTERISTICS. 

(A) LENGTH: IC base pairs 

(B ) TYPE •. nucleic acid 
£C) STRANDEDNESS: single 
ID) TOPOLOGY: linear 

;xi) SEQUENCE DESCRIPTION: SEQ ID NO:5C: 

TTTCTTACAG 10 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 21 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
3CTAAAGGGC TCAGAGTGCA G 21 
[2) INFORMATION FOR SEQ 13 NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: smcle 
ID) TOPOLOGY : linear 

ixi) SEQUENCE DESCR I ?T ICN : SEQ '.Z NO : 52 : 

jAGAAcctct cgaccttcca c :i 
(2) information for seq id no : 5 3 : 
(i) sequence characteristics: 

(A) LENGTH: 20 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
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■D) TOPOLOGY : linear 

5 ;xi) SEQUENCE DESCRIPTION: SZQ ID NO: 53: 

TGTCCTTCCC CATTGGTTTG 2 0 

(2) INFCRMATICM FOR SEQ ID NO: 54: 

(l) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 21 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

'5 (xi) SEQUENCE DESCRIPTION: SZQ ID STO : 54 : 

CAGTTTCATC GAGTTCTAGC C 21 

(2) INFORMATION FOR SEQ ID NO : b b : 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

2S f x i) SEQUENCE DESCRIPTION: SEQ ID NO:5S: 

AGTGCAGAGA AAAGGTTTTA TTAATGAT 2 8 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

TGGAAAGCTC AC3TCAGCC 15 

(2) INFORMATION FOR SEQ ID NO : £ 7 : 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH : 20 base pairs 

<B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

•*£ 'xi) SEQUENCE DESCRIPTION'- SEC ID SO : 5 7 : 

GCTTCTTTGA GGGTAGCACA 2 0 

[2) INFORMATION FOR SZQ ID NO:53: 



5C 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D) TOPOLOGY • linear 



EP 0 613 945 B1 



(xii SEQUENCE DESCRIPTION: SEQ ZD NO: 58: 

GCTCAACTCT GAGGCCAACT 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
i A) LENGTH: 2 0 base paira 

(B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 
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(xi} SEQUENCE DESCRIPTION : SEQ ID NO: 59: 

CCTCACTTCC CCTCACAGAG 2C 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 20 base Dairs 
£B) TYPE-, nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ IC NO: 60: 

CCCATGACCC AAATTAACGC 2 0 

(2) INFORMATION FOR SEQ ID NO : 6 1 : 

fi! SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 baee pairs 
(3/ TYPE ; nucleic acid 

(C) STRANDEDNESS: single 
(D; TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 61: 

GCTCTCCCTT TCTTCTTTCC 20 

(2) INFORMATION FOR SEQ ID NO; 62; 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 22 baee pairs 
(S) TYPE: nucleic acid 
(C> STRANDEDNESS: single 
(D> TOPOLOGY; linear 

(xi j SEQUENCE DESCRIPTION; SEQ ID NO : S 2 : 

TCCTTCAAGT CCTTTGGTTA GC 2 2 

{2) INFORMATION FCR SEQ ID NO : 5 3 : 

45 (i) SEQUENCE CHARACTERISTICS' 

■A) LENGTH: 22 base pairs 
(B> TYPE; nucleic acid 
;C} STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: S£Q ID SO . $ Z ; 
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(2) INFORMATION FOR SEQ ID MO : 64 : 



■i) SEQUENCE CHARACTER* ST ICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

C C ) STRANDEDNESS: singLe 
(D) TOPOLOGY: linear 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
TTAGACCACA TATCTGCTAT G 
(2) INFORMATION FOR SEQ ID NO: 65: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CATGTGTAGG TTTTTTATTT TGC 
(2) INFORMATION FOR SEQ ID NO: 65: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
{&) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION': SEQ ID NC:6S; 
3G GCCCATAAAG GAATGTAAAC C 

(2) INFORMATION FOR SEQ ID NO: 67: 



(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 21 baae pairs 

(B) TYPE: nucleic acid 
;C) STRANDEDNESS: single 
(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N'0:57. 

CCATCTCACT TAGCTCCAAT G 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 
(A; LENGTH: 2 0 base pairs 
( 3 : TYPE : nucleic acid 
[Z\ STRANDEDNESS: single 
(D> TOPOLOGY: linear 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
CTCACTCACT CTCTGTCTAC 20 

SC 

: .2) I FORMATION FOR SEQ ID NO:69: 
;i) SEQUENCE CHARACTERISTICS : 
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;A) LENGTH: 20 base pairfi 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 



(Xl) SEQUENCE DESCRIPTION- SEQ ID MO:S9: 
CAATCCTTGA TTTGGTGCCC 
(2) INFORMATION FOR SEQ ID NO:7C: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GAGGTTTCAA CACACCCGGA 
(2) INFORMATION FCR SEQ ID NO: 71: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



[xi) SEQUENCE DESCRIPTION : SEQ ID NO: 71: 

GAAGGTTGAA TAAAATTTTG AGCCTC 

(2) INFORMATION FOR SEQ ID NO: 72: 

( i } SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS' single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEC ID NO: 72: 

GACAGGGAAA GATCT3CTGG ACC 

[2; INFORMATION FOR SEQ ID NO:7; : 

(i) SEQUENCE CHARACTERISTICS : 
( A ) LENGTH: 21 bite pairs 
(3) TYPE: nucleic acid 
(Ci STRANDEDNESS: single 
{ D ! TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NC:~j. 

CT3TTCTTAT TGGATCCACA G 

[2) INFORMATION FOR SEQ ID NO: 74: 

(l) SEQUENCE CHARACTERISTICS: 
(A3 LENGTH: 20 base pairs 
;B) TYPE: nucleic acid 
;C) STRANDEDNESS: single 
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ID) TOPOLOGY ; linear 

ixi) SEQUENCE DESCRIPTION: SEQ ID MO : 74 : 
AACAACCACA CCCTCAAAGC 2 0 

(2) INFORMATION FOR SEQ ID NO: 75; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

GACTTGGTGC TCCTAATTCC C 21 

(2) INFORMATION FOR SEQ IV NO: 76: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH; 24 base pairs 

<C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 



25 (xi) SECUENCE DESCRIPTION: SEQ ID NO: 76- 

CCATTATCAG TAATGAAAAC CAGG 24 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS : 
30 (A) LENGTH; 20 base pairs 

[B) TYPE; nucleic acid 
<C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

TGCTACCTGC AAGAGCTCAA 2 0 

(2) INFORMATION ?OR SEQ ID NO: 78* 

(l) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH : 19 base pairs 

;a) TYPE: nucleic acid 
;C} STRANDEDNESS: single 
f D) TOPOLOGY: linear 
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'xi) SEQUENCE DESCRIPTION: SEQ ID N0:~3: 
CTOACCACAC ACTCA ZATC 
12) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
{C} STRANDEDNESS: sinaie 
;d> TOPOLOGY: linear 



19 



51 



EP 0 613 945 B1 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO ; 79 
TCTTTGGCCC TTGTGGCAC 
5 (2) INFORMATION FOR SEQ 10 NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2C ba«e p*irs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY ; linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 
CAGGAGACCA AGCTCCAGAA 
(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE; nucleic acid 
(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 8 1 : 

7TCAGCTAAG ACCACTGTCC 20 

[2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 19 base pairs 
O) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3^: 
CGCTGCATTT CCTGCTCAG 13 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY : linear 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NC:33: 
GCTGAAAAGG CCCAGATCA 1? 
(2) INFORMATION FOR SZQ ID NC:84: 

45 {:) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
iD) TOPOLOGY: linear 



(XL) SEQUENCE DESCRIPTION: SEQ ID .\'0:94: 

- - ~ 20 
'_TTGAGGACA ACTGCTCTAu 
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■2 J INFORMATION FOR SEQ ID NO -.85: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 ba»e pairs 
<B) TYPE; nucleic acid 
(C) STRANDEDNESS: single 
(CJ TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 85; 

GGTG7CTTTT CCTGCTACCT 

(2) INFORMATION FOR SEQ ID NO: 86: 

U) SEQUENCE CHARACTERISTICS: 
[A) LENGTH : 20 base pairs 
IB) TYPE: nucleic acid 
EC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 

GGGAGGAAAG AGAACATCAC 20 

[2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
(3) TYPE: nuclftic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

30 7GTGCCATTG CCTCTGTG 13 

(2) INFORMATION FOR SEQ ID NO: 38: 

U) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
(3) TYPE: nucleic acid 
35 (rj STRANDEDNESS: single 

<D: TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO ; 88 : 

4C AGGGCACAGG GGGCTACA 

(2> INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS ; 

(K) LENGTH: 19 baac pairs 
(B> TYPE: nucleic acid 
45 (C! STRANDEDNESS: single 

( D ) TOPOLOGY: linear 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
SO TG3CCAAGTA GAGACGTGA 

(2) INFORMATION FOR SEQ ID NO : 9 0 • 
\L) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 20 base pairs 

iB) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ 13 NO : 9 0 : 

TACAAGAAAG AGACCCTGGG 

•0 (2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 19 base pairs 
(8) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



{xi} SEQUENCE DESCRIPTION: SEQ ID NO : 9 1 

rCTGCCCAAG CCCTGATGC 

(2) INFORMATION FOR SEQ ID NO. 92: 

(1) StgUtNCK CHARACTERISTICS: 
(A) LENGTH: 21 baae pairs 
<B) TYPE : nucleic acid 
<C) STRAND EDNES S : single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 2 : 
TGGTCCTGAT CAGCAAAATA C 
<2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE; nucleic acid 

(C) STRAND E DN ES S : single 

(D) TOPOLOGY: linear 



45 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N*0:?3: 

GGCATTGTTG ATATCACAGG G 21 

\2) INFORMATION FOR SEQ ID NO: 94: 

v 'i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(3) TYPE: nucleic acid 
<C) STRANDEDNESS : single 

(CJ TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID N'0:?4 

GGCAGCACCA TCACCACATA 20 

12) INFORMATION FOR SEQ ID NO.-9S: 

ii) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
- B ) TYPE : nucleic acid 
'.CJ STRANDEDNESS: single 
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iD) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 : 
CTCTCAGCTT CTTCTCTGCT 
(2) INFORMATION FOR SEQ ID NO : 9 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 6 : 

CCAGCCAGCT CCTATGGATG 

{2} INFORMATION FOR SEQ ID NO : 9 7 : 

[i) SEQUENCE CHARACTERISTICS; 

(A) LENGTK: 21 ba©e pairs 
( B > TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(Xi) SEQUENCE DESCRIPTION; SEQ 10 NC:97 

TACGGTGTGW ACTACTCTGC A 

[2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
(8) TYPE; nucleic acid 

(C) STRANDEDNESS : both. 

(D) TOPOLOGY; both 



35 (ix) FEATURE: 

(A) NAME / KEY : CDS 
(3] LOCATION: 1..21 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95; 

40 7AC GGT GTG AAC TAC TTT GCA 21 

Tyr Gly Vai Asn Tyr Phe Ala 
1 5 

(2} INFORMATION FOR SEQ ID NO: 99: 

45 ;i) SEQUENCE CHARACTERISTICS: 

■A) LENGTH: 7 atnir.o acids 
h) TYPE: amino acic 
:D> TOPOLOGY: linear 

iix) MOLECULE TYPE: protein 

5C (xi ; SEQUENCE DESCRIPTION: SEQ ID NO: 3?: 



Tyr Gly Val Asn Tyr Phe Ala 

1 ' 5 
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(2) INFORMATION FOR SEQ ID NO- 100: 

(i) SEQUENCE CHAJIACTERISTICS : 

(A) LENGTH: 15 ba«« pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : boc.n 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION ; SEQ ID NO; 100: 

GGGAAGGACC TCTTT IS 

(2) INFORMATION FOR SEC ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 
(A] LENGTH: 14 base pairs 
IS (B) TYPE: nucleic acid 

(C) STRANDEDNESS; both 

(D) TOPOLOGY: both 



10 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 01 : 



(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 243 base pairs 

;3) TYPE: nucleic acid 
(C) STRAND E C NES S : both 
{D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 0 2 : 

AGTGCAGACA AAAGGTTTTA TTAATGATTT TTGCTCACAG TGTCCTTCCC CATTGGTTTG SO 

TTATTGCAGA TGAAGTGGAA AGGGAAGGAC CTCTTTGATT TGGTGTGCCG GACTCTGGGG 12 0 

CTCCGAGAAA CCTGGTTCTT TGGACTGCAG TACACAATCA AGGACACAGT GGCCTGGCTC 13 0 

AAAATGGACA AGAAGGTTGG GCTAGAACTC GATGAAACTG GTGGGGCTGA CGTGAGCTTT 24 0 

CCA 243 
(2) INFORMATION FOR SEQ ID NO:lUJ: 

(i) SEQUENCE CHARACTERISTICS: 

(A ) LENGTH : 275 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 

GCTTGT7TGA GGGTAGCACA GGAGGAAGTG CCAATATANN G7GTGTTTGT CTTTTCCTCT 5 0 

GCAATTCTCC AGGTACTGGA TCATGATGTT TCAAAGGAAG AACCAGTCAC CTTTCACTTC 120 

TTGGCCAAAT TTTATCCTGA GAATGCTGAA GAGGAGCTGG TTCAGGAGAT CACACAACAT 13 0 

7~ATTCTTCT TACAGGTACA TCAGTCAAGG CTACCCCCCA GTTCTGAGAG AGAACTTGCC 24C 

CAGGAGTGGT TGCAGAGTTG GCCTCAGAGT TGACC 27 5 
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!2} INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : both 
(D) TOPOLOGY: both 



10 



15 



20 



{xi) SEQUENCE DESCRIPTION: SEQ 13 NO : 104 : 

GCTAAAGGGC TCAGAGTGCA GGCCGTGGGG CGCCAGGGTC CCGGSCCTGA GCCCCGCGCC iO 

ATGGCCGGGG CCATCGCTTC CCGCATGAGC 7TCAGCTCTC TCAAGAGGAA GCAACCCAAG 12C 

ACGTTCACCG TGAGGATCGT CACCATGGAC GCCGAGATGG AGTTCAA7TG CGAGGTAACC 18 0 

GGCCGGCAGC CCCGACTGCT GCGGTGACAG TCGAGGTGGA AGCTCGAGAG GTTCTC 23 5 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 188 baee pairs 

(B) TYPE: nucl«ic acid 
(Ci STRANDEDNESS: both 
(D) TOPCLOGY: both 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 105= 

CCTCACTTCC CCTCACAGAG TATCATGTCT CCCTTGTTGC TCCTTTCAGG TAAAGAAGCA 6 0 

GATTTTAGAT GAAAAGATCT ACTGCCCTCC TGAGGCTTCT GTGCTCCTGG CTTCTTACGC 120 

CGTCCAGGCC AAGCTAGGCT CAAAGAAGAA AAATG7ATTT TT>7NCTGCGC GTTAATTTGG 130 

GTCATGCG 183 

30 \2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 200 base pairs 
(3) TYPE; nucleic acid 
(Ci STRANDEDNESS: both 
(D! TOPOLOGY: both 
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{xi) SEQUENCE LESCRIPTICN; SEQ ID NO:136: 

TGGCAGTTAT CTTTAGAATC TCAATCGCCT GCTCTCCCTT TCTTCTTTCC AGTATGGTGA - 6 0 

CTACGACCCC AGTGTTCACA AGCGGGGA7T TTTGGCCCAA GAGGAATTGC TTCCAAAAAG 120 

GGTAAGAGAT TAAATTCCCT TTTCAGGAAG ACATAGCAGA TATGTGGTCT AAAAGAAAGC 180 

TAACCAAACG ACTTGAAGGA 2 00 
(2) INFORMATION FOR SEQ ID NO : 107: 

(i; SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 256 bafie pairs 

( B ) TYPE: nucleic acid 
(CJ STRANDEDNESS; both 
(D } TOPOLOGY: both 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 0 7 : 
TCTGTGTGAC TACTCCTGGT GTAGCTTTAA AATAGCTTTA CTGTTTGTAA AATGATGCAT 5G 
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AATTATAAAA GTGGCAAACA ATACCAAAT7 TAC7TCATCT GTACGTTTTT TATTTTCCTC 120 

7ATTTTTTGG TAGGTAATAA ATCTGTATCA GATGACTCCG GAAATGTGGG AGGAGAGAAT ISO 

5 TACTGCTTGG 7ACGCACAGC ACCGAGGCCG AGCCAGGTGA 3GCCCATTCA TTGTTGGTTT 24 0 

ACATTCCTTT ATGGGC 2 56 

(2) INFORMATION FOR SEQ ID NO: 10 8: 

(i) SEQUENCE CHARACTERISTICS : 
ro (A) LENGTH: 240 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

;5 {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

GAATGCTTGA TTTCGTGGCC CACCCGCTCT CCACCCATCT CACTTAGCTC CAATGACAGT 6 0 

G7CTTCCGTT CTCCCCACAG GGATGAAGCT GAAATGGAAT ATCTGAAGAT AGCTCAGGAC 12 0 

CTGGAGATGT ACGGTGTGAA CTACTTTGCA ATCCGGGTGT GTTGAAACCT CTCTGAGCTC 18 0 

CTTGTGTAGT AGACAGAGAC TGAGTGAGGG CCAGGACTGC TAAAATGGTT ACTTCTTCAT 240 

\2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

[K) LENGTH: 387 base pairs 
;BJ TYPE: nucleic acid 
Id STRANDEDNESS: both 
:D) TOPOLOGY : both 
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(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

TCTGTGGACC TGCTGAACTG CACATGTGAC AGTGTGTGCC AGATTC7TTG GAAGCTTCAA 6 0 

7AAAATTTTG AGCCTCAGCT GGCGCTTACA GTAGCTGTTC TTATTGGATC CACAGAATAA 12 0 

AAAGGGCACA GAGCTGCTGC TTGGAGTGGA TGCCCTGGGG CTTCACATTT ATGACCCTGA 13 0 

.AACAGACTG ACCCCCAAGA TCTCCTTCCC GTGGAATGAA ATCCGAAACA TCTCGTACAG 24 0 

TGACAAGGAG GTAGGACATG TGTGTACTGC AGATGGGTCC AGCAGATCTT TCCCTGTCTG 3 00 

CCCCCCTCAC TGGAGCCTCC CCAGCCAGGG CATCTCCTTG TTATTCATAG AGTCCTTTAA 3 SO 

TTCCCAGGCT TTGAGGGTGT GGTTGTT 3 97 

(2) INFORMATION FOR SEQ ID NO:110: 

<i) SEQUENCE CHARACTERISTICS : 

< A) LENGTH; 300 bace pairs 
O) TYPE: nucleic acid 
{C> STRANDEDNESS: both 
< D } TOPOLOGY: both 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:110: 

GACTTGGTGC TCCTAATTCC CTGAGGTTTA GTGCCTGGAT ACTGGC-AAGC CAGNACAAGG 6 0 

GCATAACNTC ATGCTGGTCT GTGGCCAGTG 7CGTTGCGCA TTTGTGGAAT TNCCAAT7GC 12 3 

7GG7AACAT7 CCAGGCTGTC GGACTGAAAC 7GTGTTCTGC TTGAT7C7TC CAGTTTACTA L3C 
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TTAAACCACT GCATAAGAAA ATTGATGTCT TCAACTTTAA CTCCTCAAAG CTTCGTGTTA 24 0 

ATAACCTCGT AAGTTGACAT CCTGGTAAGT TGAGATCCTG GTTTTCA7TA CTGATAATGG 3 00 

5 

(2) INFORMATION FOR SEQ ID MO: 111: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 
1 0 (D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION ; SEQ ID NO .111: 

TGCTACCTGC AAGAGCTCAA ACTGCTATCC CACTAGTGGG CCAGTAGCCA GTGAAGTAAA 6 0 

TTTGTGGATA TTAACCTTTT TGTCTGCTTC TGTGGCCACA GATTCTCCAG CTATGTATCG 12 0 

GGAACCATGA TCTATTTATG AGGAGAAGGA AAGCCGATTC TTTGGAAGTT CAGCAGATGA 18 0 

AAGCCCAGGC CAGGGAGGAG AAGGCTAGAA AGCAGGTGAG CACAACCTTG TTTTAACTGA 24 0 

TGATGTCACT GTGTGGTCAG 26 0 
(2) INFORMATION FOR SEQ ID NO: 112: 
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(i) SEQUENCE CHARACTERISTICS : 

(A> LENGTH: 292 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: bcth 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO-. 112: 

TCTTTGGCCC TTGTGGCACC CTAGGTCTCG AGCCCTGTGA T7CAATGACT GTTTTTCTTC 6 0 

ACCCCTCGCA GATGGAGCGG CAGCGCCTCG CTCGAGAGAA GCAGATGAGG GAGGAGGCTG 12 0 

AACGCACGAG GGATGAGTTG GAGAGGAGGC TGCTGCAGAT GAAAGAAGAA GCAACAATGG 18 0 

CCAACGAAGC ACTGGTGATT TCTGAGGGGC TGGGGTTCCA C-GAGGC7ACT TGGGGACTTC 24 0 

C7TGGCTTTT CTGGAGCTTG GTCTCCTGAA AACATGAGTT AGCAGCGTTT GC 29 2 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 
4Q (A) LENGTH: 365 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

CGGGAGAACA GCACATGATC CCACTTCAGC 7AAGAGCACT GTGCCCTCCA GATGCGGTCT 6 0 

GAGGAGAGAG CTGACCTGTT GGCTG AAAAG GCCCAGATCA CCGAGGAGGA GCCAAAACTT 12 0 

CTGGCCCAGA AGGCCGCAGA GGCTGAGCAG GAAATGCAGC GCATCAAGGC CACAGCGATT 160 

5C 

CGCACGGAGG AGGAGAACCG CCTGATGGAG CAGAAGGTGC TGGAAGCCGAG GTGCTGGCA 24 0 

CTGAAGATGG CTGAGGAGTC AGAGAGGAGG TGAGGGGGCA CCGGGCACCA GACTGGCGAG 3 CO 
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GAGCCTGGCC AAGGGCCGCA GACCAGCCTG CCCTGAGGCT GAGCTCTACA GCAGTTGTCC 360 
TCAAG 3 6 5 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY : both 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 4 : 

GGTGTCTTTT CCTGCTACCT GCCCTCTTCT GTGAAGCTGA CATCTCATCC TTTCCTTGCA 6 0 

GGGCCAAAGA GGCAGATCAG CTGAAGCAGC ACCTCCAGCA ACCACGCGAG GCGGAGCGAA 120 

GAGCCAAGCA GAAGCTCCTG GAGA TTG CCA CCAAGCCCAC GTACCCGGTG AGCCTGGGGG 13 0 

CCACCAGCTG GGGCTGCCTT AGTCCTGGTG ATGTTCTCTT TCCTCCC 2 27 
(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 baee pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

TGTGCCATTG CCTCTCTGGC TGCTGGAGGA TCGGTTGTCA ACACAGTAGT GTCCTTCTGT 6 0 

30 CCTTGTATGA CCCAAGCTCC TAATCCGAAA TTTCTCATTA ACAGCCCATG AACCCAATTC 120 

CAGCACCGTT GCCTCCTGAC ATACCAAGCT TCAACCTCAT TGGTGACAGC CTGTCTTTCG ISO 

ACTTCAAAGA TACTGACATG AAGCGGCTTT CCATGGAGAT AGAGAAAGAA AAGTATGTAG 24 0 

CCCCCTGTGC CCTGCTGTGG GCTTGCTGTG AACTAGACTG A 281 

35 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 
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{xil SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

TCGCCAAGTA GAGACGTGAN NCCACCNTNA AACCCTAGAT CGCACACCAA GCAGCTTGTG 6 0 

GGCCACAGAG CACCTGAGCC GTGTCTCACT GTCTGCCCAA CCCCTGATGC ATGATACCCT 120 

CTTGCCGGCA GAGTGGAATA CATGGAAAAG AGCAAGCATC TGCAGGAGCA GCTCAATGAA 18 0 

CTCAAGACAG AAATCGAGGC CTTGAAACTG AAAGAGAGGG AGACAGCTCT GGATATTCTG 24 0 

CACAATGAGA ACTCCGACAG ZGGTCGCkGC AGCAAGCACA ATACCATTAA AAAGGTACCC 3 00 

AGGGTCTCTT TCTTGTA77T TGCTGATCAG GACCA 335 
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(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 



(xil SEQUENCE DESCRIPTION : SEQ ID NO: 117: 

CAAACAAAAT CACTCATCAC GATNTCAGGC CTATCCAAGC ATTTTGCANA TGGCACTTAT 6 0 

GGCATTGTTG ATATCACAGG GTATGTTTTT GTTTTTCTTC ATTTTATTTT GCTGGTTTAG 12 0 

'5 CCTCAAGCCC AAGGCAGAAC ACCTATCTGC ATTTGAGCCC TCAAAGTAGC TTGTTCCCAG 18 0 

GTACTCTCTA TGTGGTGATG GTGCTGCCCT CTGTGATACT AACCCGTGCA TGAGNTTGCC 24 0 

TGTCTCTGTC TCGG 254 
20 (2) INFORMATION FOR SEQ ID NO; 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : both 
25 (D) TOPOLOGY: both 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

AGGACCCTGT GAGACAGAGC GGAGGTCNNG TGCCCTCTCA GCTTCTTCTC TGCTTTCTTA 6 0 

CAGCTCACCT TGCAGAGCGC CAAGTCCCGA GTGGCCTTC7 TTGAAGAGCT CTAGCAGGTG 120 

ACCCAGCCAC CCCAGGACCT GCCACTTCTC CTGCTACCGG GACCGCGGGA TGGACCAGAT 18 0 

ATCAAGAGAG CCATCCATAG GGAGCTGGCT GGGGGTTTCC GTGGGAGCTC CAGAACTTTC 24 0 
CCCAGCTGAG TGAAGAGCCC AGCCCCTCTT ATGTGCAATT CCCTTGAACT ACGACCCTGT 
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AGAGATTTCT CTCATGGCGT 7CTAGTTCTC TGACCTGAG 339 



Claims 

45 1 . An isolated, purified or recombinant nucleic acid sequence comprising: 

(a) a merlin structural gene as shown in Figure 3 or a functional derivative thereof or its complementary strand; 

(b) a sequence homologous to, or which hybridises to either sequence in (a) under stringent conditions; or 



50 



(c) is homologous to, or would hybridise under stringent condrtions to, a sequence in (a) or (b) but for the 
degeneracy of the genetic code; or 



;a) a "?e- . gene conta ntrg tne Trapped exons are snown n rigure 
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(b) a sequence homologous to, or which hybridises to erther sequence in (a) under stringent conditions; or 

(c) is homologous to. or would hybridise under stringent conditions to, a sequence in (a) or (b) but for the 
degeneracy of the genetic code; or 

5 

(d) an oligonucleotide specific for any of (a), (b) or (c). 

3. A nucleic aad sequence comprising a mutant merlin gene wherein the mutation in the gene is as shown in Table 3 
or in Table 4 in the sequence according to Figure 3. 

:o 

4. A nucleic acid sequence comprising a mutant merlin gene wherein the mutation in the gene is in a sequence as 
defined in claim 1 or claim 2 

5. A nucleic acid sequence comprising a mutant merlin gene wherein there is an A ^ T transversa at the first base 
is of amino acid 220 according to Figure 3. 

6. A nucleic acid sequence according to any preceding claim wherein the nucleic acid encoding merlin is operably 
linked to transcriptional and/or translational expression signals. 

20 7. A vector comprising the nucleic acid sequence of any preceding claim. 

8. A host cell transformed or transfected with a vector according to claim 7. 

9. A host ceil according to claim 8 which is the £ coli cell line deposited under Accession No. ATCC 75509 

25 

10. An isolated, purified or recombinant polypeptide comprising a merlin protein having the amino acid sequence 
shown in Figure 3 and/or is the protein expressed by the £ coli cell line deposited under Accession No. ATCC 
75509, or a mutant or variant thereof having merlin protein activity. 

30 11. An isolated, purrfied or recombinant polypeptide comprising a merim protein having an amino aad sequence as 
shown in Figure 7, or a mutant or variant thereof. 

12. An ami-merlin antibody raised against a polypeptide defined in claim 10 or claim 1 1 , optionally labelled. 

35 13. A method of detecting a mer/;n-associated tumour and/or disease or neurofibromatosis-2 (NF2) in a patient sus- 
pected of having or having predisposition to, said disorders, the method comprising detecting the presence of 
and/or evaluating the characteristics of merlin DNA or merlin mRNA comprising a nucleic acid sequence as 
defined in any one of claims 1 to 5, and/or merlin protein comprising a polypeptide as defined in claim 10 or claim 
1 1 , in a sample taken from the patient. 

14. A method according to claim 13 which comprises detecting and/or evaJuating whether the merlin DNA is deleted, 
missing, mutated, aberrant or not expressing normal merlin protein. 

15. A method according to claim 13 which comprises: 

(a) taking a biological, tissue or biopsy sample from the patient; 

(b) detecting the presence of and/or evaluating the characteristics of merlin DNA, merlin mRNA and/or merlin 
protein in the sample to obtain a first set of results; 

(c) comparing the first set of results with a second set of results obtained using the same or similar methodol- 
ogy for an individual not suspected of having said disorders; and 

(d) if the first and second set of results differ in that the merlin DNA is deleted missing, aberrant, mutated or 
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(a) taking a tissue, biological or biopsy sample from the patient; 

(b) detecting the presence of and/or evaluating the characteristics of merlin DNA, merlin mRNA and/or merlin 
protein in the sample to obtain a first set of results; 

5 

(c) comparing the first set of results with a second set of results obtained using the same or similar methodol- 
ogy for evaluations in an individual known to have at least one of said disorders; and 

(d) rf the first and second set of results are identical, this indicates that the merlin DNA in the patient is deleted, 
ig mutated or not expressing normal merlin protein. 

17. A method according to either claim 15 or 16 wherein the second set of results is obtained using merlin DNA con- 
taining a mutation at the first position of amino acid 220 according to Figure 3 which is an A -> T transversion. 

is 18. A method according to any of claims 13 to 17 wherein the detection or evaluation is by a Southern blot procedure 
using DNA from the sample digested with the restriction enzyme Rsa\ or an antibody according to claim 1 1 

19. A method of characterising a mutation in a subject suspected of having a mutation in the merlin gene comprising 
a nudeic acid sequence as defined in any one of claims 1 to 5, the method comprising: 

20 

(a) amplifying each of the exons in the merlin gene of the subject 

(b) denaturing the complementary strands of the amplrf ied exons; 

25 (c) diluting the denatured separate, complementary strands to allow each single-stranded DNA molecule to 

assume a secondary structure conformation; 

(d) subjecting the DNA molecules to electrophoresis under non-denaturing conditions; 

30 (e) comparing the electrophoresis pattern of the single-stranded molecules with the electrophoresis pattern of 

single-stranded molecules containing the same amplified exon from a control individual which has either a nor- 
mal or NF2 heterozygous genotype; and 

(f) sequencing any amplification product which has an electrophoretic pattern different from the pattern 
35 obtained from the DNA of the control individual. 

20. A vector as defined in claim 6, or a nucleic acid sequence as defined in any one of claims 1 to 3, encoding a func- 
tional merlin protein, or a functional merlin polypeptide as defined in claim 9, for use in gene therapy. 

40 21 . The use of a vector as defined in claim 7, or a nucleic acid sequence as defined in any one of claims 1 to 5, encod- 
ing a functional merlin protein, or a functional merlin polypeptide as defined in claim 10, in the preparation of an 
agent for the treatment or prevention of a me/7/77-associated tumour or disease or neurofibromatosis-2 (NF2). 

22. The use according to claim 2 1 wherein the treatment or prevention is for schwannoma or meningioma (either being, 
45 for example, vestibular, cranial nerve or spinal nerve root), posterior capsular lens opacities, deafness or hearing 

loss, balance disorders or paralysis. 

23. A pharmaceutical composition, suitably adapted for injection, comprising a vector as defined in claim 7, or a nucleic 
acid as defined in any one of claims 1 to 5, encoding a functional merlin protein, a functional merlin polypeptide as 

5c defined in claim 10 or host as defined in claim 8 expressing a functional protein, or a combination thereof, and a 
pharmaceutical^ acceptable carrier 

Patentanspruche 
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(b) einer Sequenz. die zu etner Sequenz in (a) homolog ist Oder damrt unter strinQerrten Bedingungen hybridi- 
siert, Oder 

(c) die zu einer Sequenz in (a) Oder (b) homolog ist Oder ohne die Degeneriertheit des genetischen Codes 
unter stringenten Bedingungen damrt hybridisieren wurde; Oder 

(d) einem fur (a), (b) oder (c) spezifischen Oligonucleotid. 
Isolierte, aufgereinigte Oder rekombinante Nucleinsauresequenz mrt: 

(a) emem merlin-Gen mrt den in Figur 5 gezeigten gefangenen Exons; 

(b) einer Sequenz, die zu emer Sequenz in (a) homolog ist oder damit unter stringenten Bedingungen hybridi- 
siert; oder 

(c) die zu einer Sequenz in (a) oder (b) homolog ist oder ohne die Degeneriertheit des genetischen Codes 
unter stringenten Bedingungen damit hybridisieren wurde; oder 

(d) emem fur (a), (b) oder (c) spezifischen Oligonucleotid. 

3. Nucleinsauresequenz mit einern mutierten meriin-Gen, wobei die Mutation des Gens, wie gt^eiyi in Tabeile 3 oder 
Tabelle 4, in der Sequenz gemaG Figur 3 ist. 

4. Nucleinsauresequenz mit einem mutierten merlin-Get), wobei die Mutation des Gens sich in einer Sequenz nach 
25 Anspruch 1 oder Anspruch 2 bef indet. 

5. Nucleinsauresequenz mit einem mutierten merlin-Gen, in dem sich eine A > T Transversion an der 1 . Base der 
AminosSure 220 gemAB Figur 3 befindet. 

30 6. Nucleinsauresequenz gernaB einem der vorhergehenden Anspruche, wobei die fur merlin kodierende Nuclein- 
saure operativ an transkriptonelle und/oder translationelle Expressionssignale gekoppelt ist. 

7. Vektor mit einer Nucleinsauresequenz nach einem der vorhergehenden Anspruche. 

35 8. Wirtszelle, mit einem Vektor gernaB Anspruch 7 transformiert oder transf iziert. 

9. Wirtszelle gemaB Anspruch 8, weiche die Eco//-Zellinie ist die unter Zugangsnummer ATCC 75509 hinterlegt ist. 

10. Isoliertes, aufgereinigtes oder rekombinantes Polypeptid, das ein merlin- Protein enthait welches die in Figur 3 
40 gezeigte Aminosauresequenz hat und/oder das Protein ist, welches von der Eco//-Zellinie ejcprimiert wird, die 

unter Zugangsnummer ATCC 75509 hinterlegt ist oder eine Mutante oder eine Variante davon mit me/V/n-Protein- 
aktivitat. 

11. Isoliertes, aufgereinigtes oder rekombinantes Polypeptid, mit einem merlin -Protein, welches eine Ammosaurese- 
45 quenz gemaB Figur 7 hat. oder eine Mutante oder Variante davon. 

12. Mb-merun AntikOrper, der gegen e*n m Anspruch 10 oder Anspruch 11 definiertes Polypeptid gebikjet wurde. 

13. Verfahren zur Detektion eines mer//n-assoziierten Tumors und/oder einer me^/n-assoziierten Krankheit oder Neu- 
so rofibrornatose-2 (NF2) in einem Patienten mit Verdacht auf diese StOrungen oder mit Verdacht auf die Pradisposi- 

tion fur diese StOrungen durch die Detektion der Gegenwart und/oder die Bewertung der Eigenschaften von 
merlin-DMA oder merHn-mRUA mit emer in einem der Anspruche 1 bis 5 definierten Nucleinsauresequenz 
und/oder von merlin- Protein mit einem in einem der Anspruche 10 oder 11 definierten Polypeptid in etner dem Pati- 
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15. Vertahren gem£f3 Anspruch 13 mit: 

(a) der Entnahme einer biologischen, Gewebe- oder Biopsieprobe vom Patienten; 

(b) der Detektion der Gegenwart und/oder der Bewertung der Eigenschaften von meW/n-DNA, me/V/n-mRNA 
und/oder merlin- Protein in der Probe zurn Erhaft eines 1. Satzes von Ergebnissen, 

(c) dem Vergleich des 1 . Satzes von Ergebnissen mit einem 2 Satz von Ergebnissen, der durch Verwendung 
derselben oder einer Shnlichen Methodologie fur ein Individuum erhaiten wurde. welches nicht im Verdacht 
stent, besagte StOrungen zu haben; wobei 

(d) falls der 1 . und 2. Satz von Ergebnissen sjch darin unterscheiden, daft die merlin-QNA deletiert ist, fehlt, 
aberrant ist, mutiert ist Oder kein me riin -Protein exprimiert dies die Gegenwart dieser Stdrungen oder die Pre- 
disposition oder die Tendenz des Patienten anzeigt, diese StOrungen zu entwickeln. 

16. Vertahren gemdf3 Anspruch 13 mit: 

(a) der Entnahme einer Gewebe-, biologischen- oder Biopsieprobe von dem Patienten; 

(b) der Detektion der Gegenwart und/oder dem Bewerten der Eigenschaften von me/7/n-DNA, me/7/n-mRNA 
und/oder menin- Protein in der Probe urn einen 1 . Sau von Ergebnissen erhaiten; 

(c) dem Vergleich des 1. Satzes von Ergebnissen mit einem 2 Satz von Ergebnissen, der durch Verwendung 
derselben oder ahnlicher Methodologie fur Bewertungen in einem Individuum erhaiten wurde, von dem 
bekannt ist, daB es mindestens eine der genannten St6rungen aufwetst; wobei 

(d) falte der 1 . und 2. Satz von Ergebnissen identisch sind, dies anzeigt, daB die meriin-ONA in dem Patienten 
deletiert oder mutiert ist Oder kein normales mer//n-Protein exprimiert. 

17. Vertahren nach einem der Anspruche 15 oder 16, wobei der 2. Satz von Ergebnissen durch die Verwendung einer 
merlin -ON A erhaiten wird, die eine Mutation an der 1 . Position von Aminosdure 220 gemaB Figur 3 enthait, welche 
eine A -> T Transversion ist. 

18. Vertahren gem&3 einem der Anspruche 13 bis 17, wobei die Detektion oder Bewertung durch ein Southern Bot 
Vertahren urrter Verwendung von mit dem Restriktionsenzym Rsal verdauter DNA aus der Probe oder durch einem 
Antik6rper gemaft Anspruch 1 1 vorgenommen wird. 

19. Vertahren zur Charakterisierung einer Mutation in einem Wesen, welches unter Verdacht stent, eine Mutation mit 
einer Nucleinsfluresequenz gemSB einem der Anspruche 1 bis 5 in dem merlin-Gen zu haben durch: 

(a) Amplif ikation jedes der Exons im merlin-Gen des Wesens; 

(b) Denaturierung der Komplementarstrange der amplifizierten Exons; 

(c) Verdunnung der denaturierten, separaten Komplementarstrange. urn es jedem einzelstr&ngigen DNA- 
Molekul zu erlauben, eine Sekundarstrukturkonformation einzunehmen. 

(d) Elektrophorese der DNA-Molekule unter nicht-denaturierenden Bedingungen; 

(e) Vergleich des Elektrophoresemusters der einzeistrangigen Molekule mit dem Elektrophoresemuster von 
emzelstrangigen Molekulen, die dasselbe amplrfizierte Exon von einem Kontroliindividium enthalten, welches 
entweder etnen normalen oder einen heterozygoten NF2-Genotyp hat; 
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kodiert. zur Verwendung in der Gentherapie. 

21. Verwendung eines Vektors, wie in Anspruch 7 definiert Oder einer Nucleins&uresequenz, wie in einem der Anspru- 
che i bis 5 definiert, der Oder die fur ein funktionelles mer//n-Protein Oder mer/^-Polypeptid wie in Anspruch 10 
definiert, kodiert, zur Herstellung etnes Mittels fur die Behandlung oder Vorbeugung eines mer///>assozierten 
Tumors Oder einer me/7/n-assozierten Krankheit Oder Neurofibromatose-2(NF2). 

22. Verwendung gemafB Anspruch 21 , wobei die Behandlung oder Vorbeugung Schwannome oder Meningiome (bei- 
spieisweise vestibular©, von Himnerven oder Spinalnervenwurzein stammende), posteriore kapsuiare Linsentru- 
bungen, Taubheit Oder Verlust des Horvermogens, GleichgewichtsstOrungen oder Paralyse behilft. 

23. Pharmazeutische Zusammensetzung, geeignet fur die Injektion, mit einem Vektor wie in Anspruch 7 definiert, oder 
einer Nucleins£ure, wie in einem der Anspruche 1 bis 5 definiert, der oder die fur ein funktionelles merlin- Protein 
oder ein funktionelles mer^n-Polypeptid, wie in Anspruch 10 definiert, kodiert Oder einem Wirt, wie in Anspruch 8 
definiert. der ein funktionefies Protein exprimiert oder eine Kombination davon und einem pharmazeutisch akzep- 
tablen Trager. 

Revendlcatlons 

1. Sequence dacide nucleique, isolee, purrfiee ou recombinante, comprenant : 

(a) un gene de structure merlin tel que represents sur la Figure 3 ou un derive fonctionnel de celui-ci ou son 
brin complementaire ; 

(b) une sequence homologue de, ou qui s hybnde a, I'une ou I'autre des sequences indiquees en (a) dans des 
conditions stringentes ; ou 

(c) est homologue de, ou s'hybriderart dans des conditions stringentes a, une sequence indiquee en (a) ou (b) 
mais pour la degenerescence du code genetique ; ou 

(d) un oligonucleotide specif ique pour I'un quelconque parmi (a), (b) et (c). 

2. Sequence d acide nucleique, isolee, purrfiee ou recombinante, comprenant : 

(a) un gene merlin contenant les exons pieges tels que represents sur la Figure 5 ; 

(b) une sequence homologue de, ou qui s'hybride a, Tune ou I'autre des sequences indiquees en (a) dans des 
conditions stringentes ; ou 

(c) est homologue de, ou s'hybriderart dans des conditions stringentes a, une sequence indiquee en (a) ou (b) 
mais pour la degenerescence du code genetique ; ou 

(d) un oligonucleotide specrfique pour I'un quelconque parmi (a), (b) ou (c). 

3. Sequence dacide nucleique comprenant un gene merlin mutant, dans taquelle la mutation dans le gene est telle 
que representee dans le Tableau 3 ou le Tableau 4 dans ia sequence selon la Figure 3. 

4. Sequence d acide nucleique comprenant un gene merlin mutant, dans laquelle la mutation dans le g£ne est dans 
une sequence telle que delinie a la revendication 1 ou la revendication 2. 

5. Sequence d'acide nucleique comprenant un gene merlin mutant, dans laquelle il y a une tranversion A -> T au 
niveau de la premiere base de I'acide amine 220 selon la Figure 3. 

c 'v*- o>- : --> t- ^rH^in- p cp'on i''inp nup'ronrn jp dp^ 'pvencirations or^c^dpntps riars laoueile I'acide nuclei - 
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8. Cellule h6te transform^ ou transfectee par un vecteur tel que defmi k la revendication 7. 

9. Cellule hole selon la revendication 8, qui est la lignee cellulaire d'E cod deposee sous te No. d'acc&s ATCC 75509 

5 10. Polypeptide isoie, purilie ou recombinant, comprenant une protetne merlin ayant la sequence d ecides amines 
representee sur la Figure 3 et/ou qui est la proline exprimee par la lignee cellulaire d'E. coli deposee sous le No. 
d'acces ATCC 75509, ou un mutant ou variant de celle-ci ayant une activite de proteine merlin. 

11. Polypeptide isoie, purifie ou recombinant, comprenant une proteine merlin ayant une sequence d'acdes amines 
ic telle que representee sur la Figure 7, ou un mutant ou variant de celle-ci. 

12. Anticorps anti- merlin dirige contre un polypeptide defini k la revendication 10 ou la revendication 11, facultative- 
ment marque 

is 13. Precede de detection d'une tumeur et/ou maladie associee ci merlin ou de la neurofibromatose-2 (NF2) chez un 
patient suspecte d'avoir lesdits, ou ayant une predisposition auxdits. troubles, le proced6 comprenant la detection 
de la presence, et/ou revaluation des caracteristiques, de I'ADN merlin ou de I'ARNm merlin comprenant une 
sequence d acide nucieique telle que detinie k Tune des revendications 1 k 5, et/ou d'une proteine merlin compre- 
nant un polypeptide tel que defini k la revendication 1 0 ou k la revendication 1 1 , dans un echantillon preieve sur le 

20 patient. 

14. Precede selon la revendication 13, qui comprend la detection et/ou revaluation du fait de savoir si i'ADN merlin est 
supprime, est manquant, a mute, est aberrant ou n'exprime pas la proteine merlin normal e 

25 15. Precede selon la revendication 13, qui comprend les operations consistant k : 

(a) prelever un echantillon de tissu, biologique ou de btopsie chez un patient ; 

(b) detecter la presence, et/ou evaluer les caracteristiques, de I'ADN merlin, de I'ARNm merlin et/ou de la pro- 
30 teme merlin dans rechantillon, afin d'obtenir un premier ensemble de resurtats ; 

(c) comparer le premier ensemble de resurtats avec un second ensemble de resurtats obtenu k I'atde de la 
m§me methodologie ou d une methodologie analogue chez un individu non suspecte d'avoir de tels troubles ; 
et 

35 

(d) si les premier et second ensembles de resultats different par le fait que I'ADN merlin comporte une deletion, 
est manquant, est aberrant, a mute ou n'exprime pas la proteine merlin, ceci indique ators la presence, ta pre- 
disposition ou la tendance du patient k developper lesdits troubles. 

4c 16. Precede selon la revendication 13, qui comprend les operations consistant k : 

(a) prelever un echantillon de tissu, biologique ou de biopsie sur un patient ; 

(b) detecter la presence, et/ou evaluer les caracteristiques, de I'ADN merlin, de I'ARNm merlin et/ou de la pro- 
45 teme merlin dans I'echantillon. afin d'obtenir un premier ensemble de resultats ; 

(c) comparer ie premier ensemble de resultats avec un second ensemble de resultats obtenu k ( aide de la 
meme methodologie ou d'une methodologie analogue pour des evaluations chez un individu connu pour avoir 
au moins I'un desdits troubles ; et 

(d) si les premier et second ensembles de resultats sont identiques, ceci indique que I'ADN merlin chez le 
patient comporte une deletion, a mute ou n'exprime pas la proteine merlin normale. 

1 t p-^^Hp spin^ irp c^s rpvpnoVpitionc -c pf 1^ dRH9 ipqupl te ^^onri prspmblp dp resu'tats ^st obtenu a I'aide 
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operatoire de Southern-blot a ('aide d'ADN provenant de rechantillon digere par r enzyme de restriction Rsa\ ou un 
anticorps seion la revendication 12. 

19. Proc6de de caracterisation d'une mutation chez un sujet suspects d'avoir une mutation dans le g£ne merlin com- 
s prenant une sequence d'acide nudeique telle que def mte k Tune quelconque des revendications 1 a 5, le precede 

comprenant les operations consistant k : 

(a) amplifier chacun des exons dans le g£ne merlin du sujet , 

:o (b) denaturer les brins compJementaires des exons amplifies ; 

(c) diluer les brins complemerrtaires separes, denatures, pour permettre a chaque molecule d'ADN monocate- 
naire dadopter une conformation de structure secondaire ; 

is (d) soumettre les molecules d'ADN a une electrophorese dans des conditions non denaturantes ; 

(e) comparer le diagramme d' Electrophorese des molecules monocat6n aires avec le diagramme d'eiectropho- 
r£se de molecules monocatenaires contenant le meme exon amplifie provenant d'un individu temoin qui a un 
genotype soit normal soit heterozygote pour NF2 ; et 

20 

/4\ ,^r>^rsr i^i >♦ r-trr^Hi ii+ r4 y n rr\r\\i4\^Vir\n rti li a i in Hianrommo £ I art rnr.hr> rati n i to Hifferont Hit Hiflfirammfi fiht Aril J 

a partir de I'ADN de I'individu temoin. 

20. Vecteur tel que defini k la revendication 7, ou sequence d'acide nucieique telle que def inie a I une des revendica- 
25 tions 1 a 3, codant pour une proline merlin fonctionnelle ou un polypeptide merlin fonctionnel tel que defini a la 

revendication 10, pour une utilisation en therapie genique. 

21. Utilisation d'un vecteur tel que defini k la revendication 7, ou d'une sequence d'acide nucieique telle que definte k 
Tune des revendications 1 k 5. codant pour une proteine merlin fonctionnelle ou un polypeptide merlin fonctionnel 

30 tel que d^fint k la revendication 1 0, dans la preparation d'un agent pour le traitement ou la prevention d'une tumeur 
ou maladie associee a merlin ou de la neurofibromatose-2 (NF2) 

22. Utilisation selon la revendication 21. dans laquelle le traitement ou la prevention est pour le schwannome ou le 
meningiome (Tune ou Tautre etant, par exemple, vestibulaire, des nerfs cr&niens ou de la racine des nerfs rachi- 

35 diens), les opacit6s du cristallin capsulaire posterieur, la surdite ou la perte de I'audition, les troubles de Pdquilibre 
ou la paralysie. 

23. Composition pharmaceutique, adapt6e de facon appropriee pour ('injection, comprenant un vecteur tel que d6f ini 
k la revendication 7, ou un acide nucieique tel que defini k I'unedes revendications 1 a 5, codant pour une proteine 

40 merlin fonctionnelle, un polypeptide merlin fonctionnel tel que defini a la revendication 1 0 ou un hcMe tei que defini 
k la revendication 8 exprimant une proteine fonctionnelle, ou une combinaison de ceux-ci, et un support pharma- 
ceutique me nt acceptable. 
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ACGCCAGCCG TCAGGGACCG TCCCCCAACT CCCCTTTCCG CTCAGGCAGG GTCCTCGCGG 60 
CCCATGCTGG CCGCTGCGGA CCCGCGCAGC CCACACCGTT CCCGGGCCGG CCAGCCGGCA 120 
CCATCGTGGC CCTGAGGCCT GTGCAGCAAC TCCAGGGGGG CTAAAGGCCT CAGAGIGCAC 180 
GCCGTGCGGC GCGAGGGTCC CGGGCCTGAG CCCCGCGCC ATG GCC GGG GCC AIC ^ L 
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CTT TCA AAG CAA GAA CCA GTC ACC ITT CAC TTC TIG GCC AAA TTT TAT 5 22 

Vol Ser Lys Glu Glu Pro Vol Thr Phe His Phe Leu Ala Lys Phe Tyr 
90 95 100 

CCT GAG AAT GCT GAA GAG GAG CTC CTT CAG GAG ATC ACA CAA CAT TTA 570 

Pro Glu Asn Ala Glu Glu Glu Leu Vol Gin Glu lie Thr Gin His Leu 

105 110 115 

TTC TTC TTA CAG GTA AAG AAG CAG ATT TTA GAT GAA AAG ATC TAC TGC 6^8 

Phe Phe Leu Gin Vol Lys Lys Gin lie Leu Asp Glu Lys lie Tyr Cys 

120 125 130 

CCT CCT GAG GCT TCT GTG CTC CTG CCT TCT TAC GCC GTC CAG GCC AAG 666 

Pro Pro Glu Ala Ser Vol Leu Leu Alo Ser Tyr Ala Vol Gin Ala Lys 

135 140 145 

TAT GGT GAC TAC GAC CCC ACT GTT CAC AAG CGG GCA TTT TTG GCC CAA 71* 

Tyr Gly Asp Tyr Asp Pro Ser Vol His Lys Arg Gly Phe Leu Ale Gin 

150 155 160 155 

GAG GAA TTG CTT CCA AAA AGG GTA ATA AAT CTG TAT CAG ATG ACT CCG 762 

Glu Glu Leu Leu Pro Lys Arq Vol lie Asn Leu Tyr Gin Met Thr Pro 
170 175 180 

GAA ATG TGG GAG GAG ACA ATT ACT GCT TGC TAC GCA GAG CAC CGA GGC 81 0 

Glu Met Trp Glu Glu Arg tie Thr Ala Trp Tyr Alo Glu His Arg Gly 

185 190 195 

CGA GCC AGG GAT GAA GCT GAA ATG GAA TAT CTG AAG ATA GCT CAG GAC 858 

Arg Ala Arg Asp Glu Ala Glu Met Glu Tyr Leu Lys He Ala Gin Asp 

200 205 210 
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CTG GAG ATG IAC GGT GTG AAC TAC TTT GCA ATC CGG AAT AAA AAG GGC 906 

Leu Glu Met Tyr Gly Vol Asn Tyr Phe Ala lie Arg Asn Lys Lys Gly 
215 220 225 

ACA GAG CTG CTG CTT GCA GTG GAT GCC CTG GGG CTT CAC ATT TAT GAC 954 

Thr Glu Leu Leu Leu Gly Vol Asp Ala Leu Gly Leu His He Tyr Asp 
230 235 240 245 

CCT GAG AAC AGA CTG ACC CCC AAG ATC TCC TTC CCG TGG AAT CAA ATC ^002 

Pro Glu Asn Arg Leu Thr Pro Lys He Ser Phe Pro Trp Asn Glu He 
250 255 260 

CGA AAC ATC TCG TAC ACT GAC AAG GAG TTT ACT ATT AAA CCA CTG CAT 1050 

Arg Asn He Ser Tyr Ser Asp Lys Glu Phe Thr He Lys Pro Leu Asp 
265 270 275 

AAG AAA ATT CAT GTC TTC AAG TTT AAC TCC TCA AAG CTT CCT GTT AAT 1093 

Lys Lys lie Asp Vol Phe Lys Phe Asn Ser Ser Lys Leu Arg Vol Asn 

280 285 290 

AAG CTG ATT CTC CAG CTA TCT ATC GGG AAC CAT CAT CTA TTT ATG AGG 11 46 

Lys Leu lie Leu Gin Leu Cys He Gly Asn His Asp Leu Phe Mel Arg 
295 300 305 

AGA AGG AAA GCC GAT TCT TTG GAA GTT CAG CAG ATG AAA GCC CAG GCC 1 1 9 4 

Arg Arg Lys Ala Asp Ser Leu Glu Vol Gin Gin Met Lys Alo Gin Alo 
310 315 320 325 

AGG GAG GAG AAG GCT AGA AAG CAG ATG GAG CGG CAG CGC CTC GCT CGA 1 24 2 

Arg Glu Glu Lys Alo Arg Lys Gin Met Glu Arg Gin Arg Leu Ala Arg 
330 335 340 
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CAG AAG CTC CTG GAG ATT CCC ACC AAG CCC ACG TAC CCG CCC ATG AAC 1 674 

Gin Lys Leu Leu G I u He Ala Thr Lys Pro Thr Tyr Pro Pro Met Asn 
470 475 480 485 

CCA ATT CCA CCA CCG TTG CCT CCT GAC ATA CCA AGC TTC AAC CTC ATT 1 722 

Pro He Pro Ala Pro Leu Pro Pro Asp lie Pro Ser Phe Asn Leu He 
490 495 500 

GGT GAC AGC CTG TCT TTC GAC TTC AAA GAT ACT GAC ATG AAG CGC CTT i 7 70 

G I y Asp Ser Leu Ser Phe Asp Phe Lys Asp Thr Asp Met Lys Arg Leu 
505 510 515 

TCC ATG GAG ATA GAG AAA GAA AAA GTG GAA TAC ATG GAA AAG AGC AAG i 8 1 8 

Ser Met Glu He Glu Lys Glu Lys Vol Glu Tyr Met Glu Lys Ser Lys 
520 525 530 

CAT CTG CAG GAG CAG CTC AAT GAA CTC AAG ACA GAA ATC CAC GCC TTG 1 866 

His Leu Gin Glu Gin Leu Asn Glu Leu Lys Thr Glu He Glu Ala Leu 
535 540 545 

AAA CTG AAA GAG AGG GAG ACA GCT CTG GAT ATT CTG CAC AAT GAG AAC 1914 

Lys Leu lys Glu Arg Glu Thr Ala Leu Asp He leu His Asn Glu Asn 
550 555 560 565 
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TACCAGGTGA CCCAGCCACC CCAGCACCTG CCACTTCTCC TCCTACCGGG ACCGCGGGAT 20 eu 

GGACCAGATA TCAAGAGACC CATCCATAGG GAGCTCCCTG GGGGTTTCCG TGGGAGCTCC 2124 

ACAACTTTCC CCAGCTGAGT GAACAGCCCA GCCCCTCTTA TGTGCAATTG CCTTGAACTA 2184 

CCACCCTGTA GAGATTTCTC TCATGGCGTT CTAGTTCTCT GACCTGAGTC TTTGTTTTAA 2244 
GAAGTATTTG TCT 2257 

FIG.3F 
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AGTGCAGAGA 


AAAGGTTTTA TTAATGATTT TTGCTCACAG 


TGTCCTTCCC CATTGGTTTG 


60 


TTATTGCAcj* 


TGAAGIGGAA AGGGAAGGAC CTCTTTGATT 


TGGTGTGCCG GACTCTGGGG 


120 


CTCCGAGAAA 


CCTGGTTCTT TGGACTGCAG TACACAATCA 


AGGACACAGT GGCCTCGCTC 


180 


AAAATGGACA 


AGAAG^TTGG GCTACAACTC GATGAAACTG 


GTGGGGCTGA CGTGACCTTT 


240 


CCA 






243 



FIG.15A 



GCTTCTTTGA 


GGGTAGCACA GGAGGAAGTG CCAATATANN GTGTGTTTGT CTTTTGCTCT 


60 


GCAATTCTGC 


agJgtactgga TCATGATGTT TCAAAGCAAG AACCAGTCAC CTTTCACTTC 


120 


TTGGCCAAAT 


TTTATCCTGA GAATGCTGAA GAGGAGCTGG TTCAGGAGAT CACACAACAT 


180 


TTATTCTTCT 


TACAG|j TACA TCAGTCAAGG CTACCCCCCA GTTCTGAGAG AGAACTTGCC 


240 


CAGGAGTGGT 


TGCAGAGTTG GCCTCAGAGT TGACC 


275 




FIG.15B 





GCTAAAGGGC 


TCAGAGTGCA GGCCGTGGGG CGCGAGGGTC 


CCGGGCCTGA GCCCCGCGCC 


60 


ATGGCCGGGG 


CCATCGCTTC CCGCATGAGC TTCAGCTCTC 


TCAAGAGGAA GCAACCCAAG 


120 


ACGTTCACCG 


TGAGGATCGT CACCATGGAC GCCGAGATGG 


AGTTCAATTG CGAcjsTAACC 


180 
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CCTCACTTCC CCTCACAGAG TATCATGTCT CCCTTGTTGC TCCTTTCAG^ TAAAGAAGCA 60 

GATTTTAGAT GAAAAGATCT ACTGCCCTCC TGAGGCTTCT GTGCTCCTGG CTTCTTACGC 120 

CGTCCAGGCC AAGfjTAGGCT CAAAGAAGAA AAATGTATTT TTNNCIGGCC GTTAATTTGC 180 

GTCATGGG 188 

FIG.15D 

TGGCAGTTAT CTTTAGAATC TCAATCGCCT GCTCTCCCTT TCTTCTITCC AG|rATGGTGA 60 

CTACGACCCC AGTGTTCACA AGCGGGGATT TTTGGCCCAA GAGGAATTGC TTCCAAAAAG 120 

GfjTAAGAGAT TAAATTCCCT TTTCAGGAAG ACATAGCAGA TATGTGGTCT AAAAGAAAGC 180 

TAACCAAAGG ACTTGAAGGA 200 

FIG.15E 

TCTGTGTGAC TACTCCTGGT GTAGCTTTAA AATAGCTTTA CTGTTTGTAA AATGATGCAT 60 

AATTATAAAA GTGGCAAACA ATACCAAATT TACTTCATCT GTAGGTTTTT TATTTTGCTC 120 

TATTTTTTCC TACjCTAATAA ATCTCTATCA GATGACTCCG GAAATGTGGG AGGAGAGAAT 180 

TACTGCTTGG TACGCAGACC ACCGAGGCCG AGCCAGJsTGA GGCCCATTCA TIGTTGCTTT 240 

ACATTCCTTT ATGGGC 256 
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GAATGCTTGA TTTGGTGGCC CACCCGCTCI CCACCCATCT CACTTACCTC CAATGACAGT 60 

GTCTTCCGTT CTCCCCACAG GGATGAAGCT GAAATGGAAT ATCTGAAGAT AGCTCAGGAC l 20 

CTGGAGATGT ACGGTGTGAA CTACTTTGCA ATCCG(|GTGT GTTGAAACCT CTCTGAGCTC 180 

CTTGTGTAGT AGACAGAGAC TGAGTGAGGG CCAGGACTGC TAAAATCGTT ACTTCTTCAT 240 



FIG.15G 



TCTGTGGACC TGCTGAACTG CACAIGTGAC AGTGTGTGCC AGATTCTTTG GAAGGTTGAA 60 

TAAAATTTTG AGCCTCAGCT GCCGCTTACA GTACCTGTTC TTATTGGATC CACAGjAATAA 120 

AAAGGGCACA GAGCTGCTGC TTGGAGTGGA TGCCCTGGGG CTTCACATTT ATGACCCTGA 180 

GAACAGACTG ACCCCCAAGA TCTCCTTCCC GTGGAATGAA ATCCGAAACA TCTCGTACAG 240 

TGACAAGGAG GTAGGACATG TGTGTACTGC AGATGGGTCC AGCAGATCTT TCCCTGTCTG 300 

CCCCCCTCAC TGGAGCCTCC CCAGCCAGGG CATCTCCTTG TTATTCATAC AGTCCTTTAA 360 

TTCCCAGGCT TTGAGGGTGT GGTTGTT 387 



FIG.15H 



GACTTGGTGC TCCTAATTCC CTGAGGTTTA GTGCCTGGAT ACTGGGAAGC CAGNACAAGG 60 

GCATAACNTC ATGCTGGTCT GTGGCCAGTG TGGTTGCGCA TTTGTGGAAT TNCCAATTGC 120 

TGGTAACATT CCAGGCTGTC GGACTGAAAC TGTGTTCTGC TTCATTCT TC CAG|TTTACTA 180 

TTAAACCACT GGATAAGAAA ATTGATGTCT TCAAGTTTAA CTCCTCAAAG CTTCGTGTTA 240 

i 

■ y 

FIG. 1 51 
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TGCTACCrGC 


AAGAGCTCAA 


ACTGCTATGC CACTAGTGGG CCAGTAGGCA 


GTGAAGTAAA 


60 


TTTGTGGATA 


TTAACCTTTT 


TGTCTGCTTC TG TCGCCACA gJaTTCTCCAG 


CTATGTATCG 


120 


GGAACCATGA 


TCTATTTATG 


AGGAGAACGA AAGCCGATTC TTTGGAAGTT 


CACCAGATGA 


180 


AAGCCCAGGC 


CAGGGAGGAG 


AAGGCTACM ACCAG^TGAG CACAACCTTG 


TTTTAACTGA 


240 


TGATGTCACT 


GTGTGCFCAC 






260 



FIG.15J 



TCTTTGGCCC 


TTGTGGCACC 


CTAGGTCTCG agccctgtga ttcaatgact 


gtttttcttc 


60 


ACCCCTCGCA 


g|mggagcgc 


cagcgcctcg ctcgagagaa gcagatgagg 


gaggaggctg 


120 


AACGCACGAG 


GGATGAGTTG 


gagaggaggc tgctgcagat gaaagaagaa 


gcaacaatgg 


180 


CCAACGAAGC 


actgJstgatt 


tctgaggggc tggggttcca ggaggctact 


tggggacttc 


240 


CTTGGCrTTT 


CTGGAGCTTG 


GTCTCCTGAA AACATGACT! agcagcgttt 


gc 


292 



FIG.15K 



cgcgagaaca 


gcacatgatc 


ccacttcagc taagaccact gtcccctcca g^tgcggtct 


60 


gacgagacag 


ctgacctctt 


gccigamag gcccagatca ccgaggacga ggcaaaactt 


120 


CTCGCCCAGA 


aggccgcaga 


ggctgagcag gaaatgcagc ccatcaaggc cacagcgatt 


180 


cgcacggagg 


acgagaagcg 


cctgatggag cagaaggtgc tggaagccgag gtgctggca 


240 


ctgaagatgg 


ctgaggagtc 


agagacgag); tgagggggca ccgggcacca gactggccag 


300 


gaggctggcc 


aacggccgca 


GACCAGCCTG CCCTGAGCCT GAGCTCTACA GCACITGTCC 


360 



FIG.15L 
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GGTGTCTTTT CCTGCTACCT GCCCTCTTCT GTCAAGC1GA CATCTCATCC TTTCCTTGCA 60 

(jxCCAAAGA GGCAGATCAG CTGAAGCAGG ACCTGCAGGA AGCACGCGAG GCGGAGCGAA 1 20 

GAGCCAAGCA GAAGCTCCTG GAGATTGCCA CCAAGCCCAC CTACCC(|GTC AGCCTGGGGG 180 

CCACCAGCTG GGGCTGCCTT AGTCCTGGTG AIGTTCTCTT TCCTCCC 227 

FIG.15M 



TGTGCCATTG CCTCTGTGGC 


IWJIUGAGGA 


ICCCTTCTCA ACACAGTAGT 


GTCCTTCTCT 


60 


GCTTGTATGA CCCAAGCTCC 


TAATCCGAAA 


TTTCTCATTA ACAcjcCCATG 


AACCCAATTC 


120 


CAGCACCGTT GCCTCCTGAC 


ATACCAAGCT 


TCAACCTCAT TGGTGACAGC 


CTGTCTTTCG 


180 


ACTTCAAAGA TACTGACATG 


AAGCGGCTTT 


CCATGGAGAT AGAGAAAGAA 


A^TAfGTAG 


240 


CCCCCTGTGC CCTGCTGTGG 


GCTTGCTGTG 


AACTAGACTG A 




281 



FIG.15N 

rGGCCAAGTA GAGACGTGAN NCCACCNTNA AACCCTAGAT CGCACACCAA GCAGCTTGTG 60 

GGCCACAGAG CACCTGAGCC GTGTCTCACT GTCTGCCCAA GCCCTGATGC ATGATACCCT 120 

CTTCCCGGCA G^TGGAATA CATGGAAAAG AGCAAGCATC TCCAGGAGCA CCTCAATGAA 180 

CTCAAGACAG AAATCGAGGC CTTGAAACTG AAAGAGAGGG AGACACCTCT CCATAUCTG 240 

CACAATGAGA ACTCCGACAG GGGIGGCAGC AGCAAGCACA ATACCATTAA AAAgJgTACCC 300 
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CAAACAAAAT CACTCATCAC GATNTCAGGC CTATCCAAGC ATTTTCCANA TGGCACTTAT 60 

GGCATTGTTG ATATCACAGG GTATGTTTTT GTTTTTCTIC ATTTTATTTT GCTGGTTTAG| 120 

CCTCAAGCCC AAGGCAGAAG ACCTATCTCC ATTTGAGCCC TCAA^TAGC TTGTTCCCAG 180 

GTACTCTCTA TGTGGTGATG GTGCTGCCCT CTGTGATACT AACCCGTGCA TGAGNTTCCC 240 

TGTCTCTGTC TCGG 254 

FIG.15P 



AGCACCCTGT GAGACAGAGC GGAGGTCNNG TGCCCTCTCA GCTTCTTCTC TGCTTTCTTA 60 

cagJctcacct TGCAGAGCGC CAAGTCCCGA GTGGCCTTCT TTGAAGACCT CTAGCAGGTG 120 

ACCCAGCCAC CCCAGGACCT GCCACTTCTC CTGCTACCGG GACCGCGGGA TGGACCAGAT 180 

ATCAAGAGAG CCATCCATAG GGAGCTGGCT GGGGGTTTCC GTGGGAGCTC CAGAACTTTC 240 

CCCAGCTGAG TGAAGAGCCC AGCCCCTCTT ATGTGCAATT GCCTTGAACT ACGACCCTGT 300 

AGAGATTTCT CrCATGGCGT TCTAGTTCTC TGACCTGAG 339 



FIG.15Q 
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